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Preface 


This book on extracellular matrix (ECM) proteins is the result of appreciation 
and awe for matrix biology and structural proteins. These proteins are emerging 
as much more than passive bystanders to the fascinating life, death, and fate of 
cells: they control these cells. 

Many researchers and their important work have been cited in this book; 
however, not all, and not all who deserve to be cited are included. Thus, for all 
who are working on collagens, laminins, and elastin, please send your refer- 
ences and a summary of your work to be included in future editions of this 
book. The aspiration of these books on the ECM is to be as complete as possible 
regarding research on collagen biomarkers and their biology. Please contribute 
to this ongoing aspiration. 

The functions of many collagens still remain to be discovered and presented, 
both with respect to their physiological and pathophysiological roles. The hope 
of this book, and subsequent books, is to inspire new researchers to take the col- 
lagen challenge and present novel research and biology that are important for 
understanding the role of the ECM in pathological and physiological conditions. 


Sincerely 


Morten A. Karsdal, MSc, PhD, mBMA 
Professor, University of Southern Denmark 
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Nordic Bioscience, Herlev, Denmark; ?Southern Danish University, Odense, Denmark 


The backbone of tissues is composed of structural proteins such as collagens, 
laminins, and elastin. During tissue turnover, these proteins are formed and 
degraded in a tight equilibrium to ensure tissue health and homeostasis. Imbal- 
ances in these processes can result in fibrosis. Fibrosis can affect almost any 
organ or tissue. The core protein of fibrosis is collagen and other structural pro- 
teins such as laminins and elastin. Collagens are not simply structural in role: 
each has a unique expression pattern and some have key signaling functions in 
addition to their structural functions. The common denominator for collagens is 
the triple-helix structure, which is less pronounced in laminins. Collagens are 
divided into several distinct subgroups of which the fibrillar and networking 
collagens are the most investigated. This chapter introduces the superstructure 
of collagens, laminins, and elastin as well as key features of collagen biology, 
expression, and function. 


WHY ARE COLLAGENS AND STRUCTURAL PROTEINS 
IMPORTANT? 


Fibrosis can affect almost any organ or tissue. Fibrosis is characterized by the 
formation of excess connective tissue that damages the structure and function of 
the underlying organ or tissue and can lead to a wide variety of diseases. Fibro- 
sis can result either from injury to tissue, in which case it manifests as scarring, 
or from abnormal connective tissue turnover. 

Forty-five percent of all deaths in the developed world are associated with 
chronic fibroproliferative diseases [1,2] such as atherosclerosis and alcoholic 
liver disease. The common denominator of fibroproliferative diseases is dysreg- 
ulated tissue remodeling, leading to the excessive and abnormal accumulation of 
extracellular matrix (ECM) components in affected tissues [1,3-7]. This ECM 
has an altered structure and signals abnormally to the cells that are embedded in 
it [1-5]. During fibrosis, the composition of ECM proteins and their interactions 
with each other and with the cells that attach to them are altered [1,7,8]. 

Fibrosis can affect almost any organ or tissue. Fig. | illustrates the major 
fibroproliferative diseases with a significant impact on human health [1,4,7—9]. 
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Heart 


1. Congestive heart failure 
2. Endomyocardial fibrosis 


Eyes 


1. Glaucoma 
2. Diabetic retinopathy 
3. Diabetic macular edema 


4.AMD 
5. Dry eye disease 


3. Myocardial infarction 


Liver 

1. NASH 

2. Alcoholic liver disease 

3. Schistosomiasis 

4. Idiopathic portal hypertension 
5. Congenital hepatic fibrosis 

6. Hepatitis B virus infection 

7. Hepatitis C virus infection 

8. Autoimmune hepatitis 

9. Primary sclerosing cholangitis 
10. Primary biliary cirrhosis 


vr 
7 ue 
2. Diabetic nephropathy 
Inflammatory bowel diseases J Een apait 
4. Lupus nephritis 


5. Transplant nephropathy 


Lung 

1. IPF 

2. Asthma 

3. COPD 

4. ARDS 

5. Pulmonary arterial hypertension 


Skin 

1. Hypertrophic scars 
2. Scleroderma 

3. Eosinophilic fasciitis 
4. Keloids 

5. Dermatomyositis 


latrogenic fibrosis 
1. Surgery 

2. Radiation therapy 

3. Chemotherapy 


Systemic 

1. Systemic sclerosis 

2. Atherosclerosis 

3. IgG4-related disease 

4. Nephrogenic systemic fibrosis 
5. Cystic fibrosis 

6. Chronic graft vs. host disease 


FIGURE1 Examples of fibroproliferative diseases in different organs. AMD, age-related mac- 
ular degeneration; ARDS, acute respiratory distress syndrome; COPD, chronic obstructive pulmo- 
nary disease; IPF, idiopathic pulmonary fibrosis; NASH, nonalcoholic steatohepatitis. Reproduced 
with permission from Karsdal MA, Krarup H, Sand JM, Christensen PB, Gerstoft J, Leeming, DJ, 
et al. Review article: the efficacy of biomarkers in chronic fibroproliferative diseases — early diag- 
nosis and prognosis, with liver fibrosis as an exemplar. Aliment Pharmacol Ther 2014;40:233— 
49 and Karsdal MA, Manon-Jensen T, Genovese F, Kristensen JH, Nielsen MJ, Sand, JM, et al. 
Novel insights into the function and dynamics of extracellular matrix in liver fibrosis. Am J Physiol 
Gastrointest Liver Physiol 2015. 


Fibrotic tissue was has long been considered an inactive scaffold, prevent- 
ing regeneration of the affected organ. However, this perception cannot be 
upheld because fibrosis is neither static nor irreversible, but instead the result 
of a continuous remodeling that makes it susceptible to intervention [1,10,11]. 
The major future challenge in fibrosis will be to halt fibrogenesis and reverse 
advanced fibrosis without affecting tissue homeostasis or interfering with nor- 
mal wound healing. Consequently, our increased understanding of the ECM, 
its dynamics, and the potential of fibrotic microenvironments to reverse holds 
promise for the development of highly specific antifibrotic therapies with mini- 
mal side effects. 

Traditionally, only growth factors, cytokines, hormones, and certain other 
small molecules have been considered as relevant mediators of inter-, para-, 
and intracellular communication and signaling. However, the ECM fulfils direct 
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and indirect paracrine or even endocrine roles. In addition to maintaining the 
structure of tissues, the ECM has properties that directly signal to cells. Even 
conceptually exclusively structural proteins such as fibrillar collagens or pro- 
teoglycans are emerging as specific signaling molecules that affect cell behavior 
and phenotype via cellular ECM receptors. In addition, the ECM can bind sev- 
eralfold to otherwise soluble proteins, growth factors, cytokines, chemokines, 
or enzymes, thereby restricting or regulating their access to cells as well as spe- 
cifically attracting and modulating the cells that produce these factors. More- 
over, specific proteolysis can generate biologically active fragments from the 
ECM, while the parent molecules of the ECM are inactive. The ECM thus can 
control cell phenotype by functioning as a precursor bank of potent signaling 
fragments in addition to having a direct effect on cell phenotype through ECM- 
cell interactions mediated by receptors such as integrins, certain proteoglycans, 
or both [12-14]. 

The aims of this book are to (1) summarize all current data on key structural 
proteins of the ECM (ie, collagens, laminins, and elastin); (2) review how these 
molecules affect pathologies, in part, exemplified by monogenetic disorders; 
(3) describe selected posttranslational modifications (PTMs) of ECM proteins that 
result in altered signaling properties of the original ECM component; (4) discuss 
the novel concept that an increasing number of components of the ECM harbor 
cryptic signaling functions that may be viewed as endocrine functions; and 
(5) highlight how this knowledge can be exploited to modulate fibrotic disease. 


INTRODUCTION TO THE MATRIX-INTERSTITIAL 
AND BASEMENT MEMBRANES 


When tissue is injured, endothelial or epithelial cells on the tissue surface are 
destroyed, exposing the basement membrane to degradation and an influx of 
inflammatory cells and the deeper interstitial membrane to the risk of fibrosis 
(Fig. 2). The main constituents of the basement membrane are type IV colla- 
gen, laminin, and nidogen. Fragments of type IV collagen—tumstatin—have 
been shown to be very antiangiogenic, possibly directing recovery of the epi- 
thelium by allowing horizontal growth over the basement membrane rather than 
uncontrolled vertical growth into the basement and interstitial membranes. In 
the interstitial membrane, other collagens such as type XVIII are present. A 
protease-derived fragment of collagen type XVIII, endostatin [17], is the most 
potent natural anti-angiogenic molecule which has been show to block fibrosis in 
fibrotic models of the liver and lung. Other collagens such has type XV may play 
similar or more tissue-specific roles by releasing active protein fragments (called 
neoepitopes) such as restin, although this needs to be fully investigated [16]. 

During repair of the matrix after epithelial damage, the underlying mem- 
branes are destroyed by proteases, giving rise to new signaling molecules that 
may be both antifibrogenic and antiangiogenic and could potentially have other 
functions that are yet to be discovered. 
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BIOMARKES OF ENDOTHELIAL CELL DAMAGE 
Exposure of the basement and interstitial membrane results in degradation products 


DAMA OE STRE ANE INFLAMMATORY CELLS 


(A) (B1) CELL DEATH (B2) 


Epithelium l O 
l O 
i mE SS = 
= SSCS Basement membrane | ie — 
l Macrophage 
P l | 
PÍ Interstitial membrane | of P-S 
I Myofibroblast 
l 


Epithelial cell 


Fibroblast 


FIBROBLAST ACTIVATION 
(C) tissue DAMAGE AND REPAIR BIOMARKERS! (D1 ) Tissue DAMAGE OF THE DEEP (D2) FIBROSIS 
OF THE BASEMENT MEMBRANE l INTERSTITIAL MEMBRANE 


PROLIFERATION AND ACTIVATION OF FIBROBLASTS, 
MIGRATION OF INFLAMMATORY CELLS 


FIGURE 2 Schematic representation of the generation of biomarkers of endothelial or epi- 
thelial cell damage from the basement and interstitial membranes. (A) Overview of the endo- 
thelial cell layer which has well-organized basement and interstitial membranes below. (B) Cell 
damage results in the death of localized epithelial cells, resulting in creation of myofibroblasts from 
fibroblasts which proliferate and form a matrix (B1), followed by (B2) recruitment of inflammatory 
cells such as macrophages through the damaged epithelium. (C) Continuous cell insult results in 
tissue damage and denudation of the epithelium exposing the basement membrane to degradation 
in which fragments of the basement membrane are released. (D) Deeper tissue damage exposes the 
underlying interstitial membrane to degradation and its fragments are released through the basement 
membrane (D1). Continuous inflammation load and activation of fibroblasts results in overproduc- 


tion of components of the interstitial and basement membranes in an unorganized manner, ie, in 
fibrosis (D2). 


OVERALL STRUCTURE OF COLLAGENS 


Collagens are widely expressed throughout all organs and tissues. They are 
the most abundant proteins in connective tissue. To date, 42 different collagen 
genes coding for 28 different types of collagens have been identified. Fig. 3 
schematically displays the primary structure of the molecules. Collagens are 
trimeric molecules composed of three polypeptide a-chains which contain the 
repeated sequence (G—X—Y)n, X being frequently proline and Y hydroxypro- 
line. These repeats allow the formation of a triple helix which is the characteris- 
tic structural feature of the collagen superfamily. Each member of the collagen 
family contains at least one triple-helical domain (COL) which is secreted and 
deposited into the extracellular matrix (ECM). Most collagens are able to form 
supramolecular aggregates. Besides triple-helical domains, collagens contain 
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non-triple-helical (NC) domains, used as building blocks by other ECM pro- 
teins. The molecular structure and supramolecular assembly of collagens allow 
their division into major subfamilies, depending on the supramolecular struc- 
ture such as depicted in Fig. 4. 


Fibril-forming collagens 
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Fibril-associated collagens with interrupted triple helices 
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FIGURE 3 Schematic primary structure of collagen including depiction of functional 
domains. Reproduced with permission from Ricard-Blum, S. The collagen family. Cold Spring 
Harb Perspect Biol 2011;3:a004978. 
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Network-forming collagens 
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Fibrils 
Hexagonal networks (collagens VIII and X} 


FACITs (collagen IX) 


( E E- => Beaded filaments (collagen VI) 


Network (collagen IV) 


: Anchoring fibrils (collagen VII) 
> > @ Non-collagenous domain 
= Triple-helical domain (Gly-X-Y) <TSP© Thrombospondin domain 


FIGURE 4 The supermolecular structure of collagens. Reproduced with permission from Ricard- 
Blum, S. The collagen family. Cold Spring Harb Perspect Biol 2011;3:a004978. 


_ 


. Fibril-forming collagens (I, I, II, V, XI, XXIV, XXVII); 

2. Fibril-associated collagens with interrupted triple helices (FACITs) (IX, 
XII, XIV, XVI, XIX, XX, XXI, XXII). The FACITs do not form fibrils by 
themselves but they are associated with the surface of collagen fibrils. 

3. Network-forming collagens (IV, VII, X) form a pattern in which four mol- 
ecules assemble via their amino-terminal 7S domain to form tetramers while 
two molecules assemble via their carboxy-terminal NC1 domain to form 
NC1 dimers 

4. Membrane collagens (XIII, XVII, XXIII, XXV) 


COLLAGEN SYNTHESIS AND OTHER ESSENTIALS 


Collagens such as type | collagen are synthesized in the endoplasmic reticulum 
where two pro-al chains and one pro-a2 combine to form procollagen. This 
complex process j=is mediated by hydroxylation of prolines and lysines to sta- 
bilize the helix, secretion to the extracellular space, enzymatic removal of the 
N- and C-terminal propeptides, packaging the material into fibrils, and finally 
the formation of inter-molecular cross-links leading to the final and mechani- 
cally competent collagen fibrils. Further details on the molecular aspects of 
this process are outside the scope of this article, but we refer the reader to Refs. 
[19,20]. 

Extensive research has been conducted into the biosynthesis of fibril-form- 
ing collagens that are synthesized as procollagen molecules comprised of an 
amino-terminal propeptide followed by a short, nonhelical, N-telopeptide, a cen- 
tral triple helix, a C-telopeptide and a carboxy-terminal propeptide. Individual 
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procollagen chains are subjected to numerous post-translational modifications. 
The heat shock protein 47 (HSP47) binds to procollagen in the endoplasmic 
reticulum. HSP47 is a specific molecular chaperone of procollagen [21]. The 
stabilization of the procollagen triple helix at body temperature requires the 
binding of more than 20 HSP47 molecules per triple helix [22]. It has been sug- 
gested recently that intracellular Secreted Protein Acidic and Rich in Cysteine 
(SPARC) might be a collagen chaperone because it binds to the triple-helical 
domain of procollagens and its absence leads to defects in collagen deposition 
in tissues [23]. Both propeptides of procollagens are cleaved during the matura- 
tion process. The N-propeptide is cleaved by procollagen N-proteinases belong- 
ing to the A Disintegrin And Metalloproteinase with Thrombospondin Motifs 
(ADAMTS) family, except the N-propeptide of the pro-al(V) chain that is 
cleaved by the procollagen C-proteinase, which is also termed Bone Morphoge- 
netic Protein-1 (BMP-1) [24]. BMP-1 cleaves the carboxy-terminal propeptide 
of procollagens except the carboxy-terminal propeptide of the pro-al(V) chain, 
which is processed by furin. The telopeptides contain the sites where cross- 
linking occurs. This process is initiated by the oxidative deamination of lysyl 
and hydroxylsyl residues catalyzed by the enzymes of the lysyl oxidase family. 

The top part of the figure (above the cell membrane) (Fig. 5) illustrates the 
intracellular events and the bottom part of the figure (below the cell membrane) 
illustrates the extracellular events. During the synthesis of pro-a chains in the 
endoplasmatic reticulum specific peptidyl lysine residues are hydroxylated to 
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FIGURE5 Biosynthesis of collagen. Reproduced with permission from Yamauchi M, Sricholpech 
M. Lysine post-translational modifications of collagen. Essays Biochem 2012;52:113-33. 
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form hydroxylysine (—OH—NH,) and, subsequently, specific glycosylated 
hydroxylysine residues, this latter step being called O-linked glycosylation. 
For the latter, either single galactose (a red hexagon) or glucose-galactose (two 
red hexagons) are attached. After these and other modifications (for example 
hydroxylation of proline, or asparagine-linked glycosylation shown as closed 
circles in the C-propeptide), two pro-al chains (solid line) and one pro-a2 
chain (dotted line) associate with one another and fold into a triple helical mol- 
ecule from the C- to the N-terminus to form a procollagen molecule, packaged 
and secreted into the extracellular space. Then both N- and C-propeptides are 
cleaved to release a collagen molecule. The collagen molecules are then spon- 
taneously self-assembled into a fibril and stabilized by covalent intra- and inter- 
molecular covalent cross-linking. During fibrillogenesis, molecules are packed 
in parallel and are longitudinally staggered by an axial repeat distance, D period 
(~67 nm) creating two repeated regions, that is, the overlap and hole regions, in 
the fibril. 


COLLAGEN TURNOVER AS FUNCTION OF AGE 


Collagen turnover is highly affected by age, which is important for designing 
and interpretation of experimental settings. In fact, type I and II collagen are 
more than 100 fold higher in 1 months young animals as compared to 6 months 
old animals [26]. As illustrated in Fig. 6 (reprinted with permission from Ref. 
[26]), collagen turnover is drastically different in animals undergoing remodel- 
ing (rebuilding of tissues) in face of the modeling period, during building of 
tissues. 
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FIGURE 6 Scheme of age-dependent ECM turnover and serum biomarker development at 
different age and subsequent to pathological remodeling/fibrosis. Reproduced with permission 
from Karsdal MA, Genovese F, Madsen EA, Manon-Jensen T, Schuppan D. Collagen and tissue 
turnover as a function of age: implications for fibrosis. J Hepatol 2015. 
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This consequents in three important observations (1) the matrix composition 
and quality may be different in older versus younger animals (2) the relative 
induction of a response to an insult and pathology in older versus younger ani- 
mals is higher, which is important when interpreting biomarkers as this provides 
better contrast as a smaller induction will not be detectable at high expression 
levels in young animals, albeit highly detectable in low turnover situations. This 
has been reported for many collage type I and II markers such as CTX-I and 
CTX-II [27-32]. (3) Much fibrosis research and tissue turnover research, have 
been conducted in younger animals which has a higher capacity for repair and 
turnover, which may have resulted in both false negative but also false positive 
observations [16,26]. 

Much of this regulation of these collagens is consequent to the closure of the 
growth plate, but also other collagens are affected 1- to 4-fold. In contrast, the 
interstitial type III collagen is upregulated by 1- to 2-fold [26], and basement 
membrane is upregulated 3-fold [26], as evident by biomarkers of type IV col- 
lagen formation and degradation. Both Type III and IV collagen stabilizes after 
1-2 months after birth in rats. Other collagens such as type V and VI are not 
regulated [16,26,33]. Carefully designed experiments and biomarkers are lack- 
ing to provide data on the remaining of the 28 collagens. 

In man, there is a strong age dependency on collagens, which however only 
has been carefully investigated with respect to type I and II collagen. Type I 
and II collagen, levels of at the age of 25, and increases after menopause, age 
55+5 years, consequent to the loss of sex hormones by 100-150% [32,34], 
which correspond to the loss in bone mineral density [31,35—39]. This is well 
documented in bone biology, where PINP, CTX-I have been used as biomarkers 
for decades [28,30,40-44]. 


WHY LAMININS? 


Laminins as collagens are structural proteins with helical regions, albeit not a 
stringent as seen in the triple helical region of collagens. In collagens the helical 
domain consists of a strict building block which consists of three amino acids 
Gly-X-Y, where X and Y are often proline (Pro) and hydroxyproline (Hyp), 
respectively. In contrast, the triple helical structure found in laminins is made of 
heptads which has a less strict organization. The heptad structure represented as 
(abcdefg),, often has hydrophobic residues at positions a and d [45]. An example 
of the lower level of strictness in the heptad structure is seen when comparing 
the coiled-coil regions between species. Comparison of the laminin a5 chain 
from mammals with that of insects (that is, drosophila melanogaster) revealed 
that there was only 29% identity whereas the other domains had up to 60% 
identity [46]. It thus seems that the sequence motif does not rely on specific 
residues, but rather depends on the polarity. 

Compared to collagens that are present in all compartments of the body, 
laminins are solely found in the basement membrane. The basement membrane 
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is an intricate meshwork composed of laminins, collagen IV, nidogens, and 
sulfated proteoglycans which separates the epithelium, mesothelium and endo- 
thelium from connective tissue [47,48]. Even though the basement membrane 
consists of the same proteins throughout the body, different isoforms of these 
combine to form structurally and functionally diverse basement membranes. 
During the maturation of most basement membranes the composition of lami- 
nins changes. For example, as part of maturation of the glomerulus, laminin-1 
is present in the early stages but is gradually replaced by laminin-10 and -11. In 
the final stages of maturation laminin-10 disappears, leaving laminin-11 as the 
sole laminin in the glomerular basement membrane (GBM) [49]. 

The crucial role of laminins in the basement membrane is seen during devel- 
opment of mice embryos where it is sufficient for the formation of basement 
membrane-like structures even in the absence of the other major basement 
membrane protein, type IV collagen [50]. Furthermore, except for the laminin 
a4, -B2 and -y3 chains, deficiency of either of the laminin chains leads to early 
lethality [51-73]. Even for those that do not cause early lethality other major 
complications arise such as decreased microvessel growth and complete amy- 
elination of nerves for Lama4-deficiency [74-77]. 

Laminins carry out a central role in organizing the intricate meshwork of 
the basement membranes. This is seen through the wide range of interaction 
partners which include dystroglycan [78-80], nidogens [81-83], syndecans 
[84-86], integrins [87—89], heparin [78,84,90], sulfatides [91] and more. The 
wide range of interactions combined with early lethality seen with deficiency 
of most laminin chains underlines the necessity of laminin presence in the 
basement membrane. In general, the essential role of laminins for mainte- 
nance of the basement membranes is exemplified by the lethality of most null 
mutations. 
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SUMMARY 


Type I collagen is a fibrillar type collagen, and most likely the best investigated 
collagen. Type I collagen is the most abundant collagen and is the key structural composi- 
tion of several tissues. It is expressed in almost all connective tissues and the predomi- 
nant component of the interstitial membrane. Type I collagen mutations have documented 
important roles in a range of diseases, with particular focus on bone and connective tissue 
disease, in particular osteogenesis imperfecta and Ehlers—Danlos syndrome. Type I col- 
lagen is predominantly modified at the posttranslational level, with several crosslinks and 
other modifications. Several biomarkers of type I collagen have been developed, of both 
type I collagen degradation and formation, as surrogate makers of bone degradation and 
formation, respectively. Type I collagen formation is also associated with fibrosis, and 
fibrogenesis. 


Type I collagen is the most abundant type of collagen and is expressed in almost 
all connective tissues. It is the major protein in bone, skin, tendon, ligament, 
sclera, cornea, and blood vessels. Type I collagen comprises approximately 95% 
of the entire collagen content of bone and about 80% of the total proteins present 
in bone [1]. 

Type I collagen is a heterotrimer molecule. In most cases it is composed of two 
al chains and one a2 chain, although an al homotrimer exists as a minor form. 
Each chain consists of more than 1000 amino acids, and the length of a collagen 
type I molecule is ~300nm and the width about 1—Snm. It has three domains: 
an N-terminal non-triple helical domain (N-telopeptide), a central triple helical 
domain, and a C-terminal non-triple helical domain (C-telopeptide), of which the 
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central domain is by far the largest, comprising approximately 95% of the total mol- 
ecule [2]. The triple helical domain is only possible due to the presence of glycine 
(G)-X-Y repeats, where X often is a proline and Y is a hydroxyproline. Glycine at 
every third position is essential for the correct formation of the structure [3]. 

Type I collagen is predominantly modified at the posttranslational level. Some 
of these posttranslational modifications (PTMs) are generated during the synthesis 
of the collagen fibrils and are essential for the mechanical competence of these 
fibrils [2]. Other PTMs processes, such as isomerization, racemization, enzymatic 
cleavage, and glycations, arise as a function of biological changes, and the end prod- 
ucts are proving highly relevant as biomarkers of different disease aspects [3,4]. 

PTMs accumulating during synthesis of the collagen fibrils have been exten- 
sively reviewed [2,5]. Proline residues at the third position of the G-X-Y repeats 
are hydroxylated in most of the chains, whereas in some cases proline resi- 
dues at the second position are also hydroxylated and these hydroxylations have 
proven essential for the stability of the helix [6]. 

A critical factor in the synthesis of a biomechanically competent type I colla- 
gen fibril is the hydroxylation of lysine residues, of which there are 38 in the a1 
chain and 31 in the a2 chain [2]. The extent of hydroxylation of these residues is 
highly tissue dependent and ranges from 15% to 90%, with the level reflecting 
both tissue function and in some cases pathological changes [7]. Upon secre- 
tion of the triple helices, the N- and C-terminal peptides and propeptides are 
removed by enzymatic cleavage involving matrix metalloproteinases (MMPs) 
[8]. After cleavage, lysines and hydroxylysines in some positions, including the 
C terminus, are crosslinked between the individual helices, leading to the for- 
mation of extensive covalent intra- and intermolecular crosslinks, which are 
crucial for the mechanical competence of the collagen fibrils [9]. 

In addition to these PTMs, some of the hydroxylysines in type I collagen 
are glycosylated, although the exact extent of glycosylation is highly variable, 
being affected by multiple aspects such as tissue type and pathologies [10-16]. 
Finally, PTM processes arising spontaneously by aging, disease, or both have 
been described and include racemization or isomerization of aspartate and 
asparagine residues as well as nonenzymatic glycations, leading to the forma- 
tion of advanced glycation end products (AGEs) [3]. In type I collagen, accu- 
mulation of AGEs as a function of age, elevated blood glucose levels, or both 
has been described. The most understood of these AGEs is pentosidine. Pento- 
sidine is formed by condensation of glucose and the available amino groups of 
either arginine or lysine, which leads to the formation of a very stable crosslink 
between collagen chains [17]. Addition of AGE crosslinks to collagen fibers 
alters their mechanical properties. Studies have shown increased stiffness of tis- 
sues containing AGE-modified type I collagen fibers [18-20], hence illustrating 
the detrimental nature of these AGEs. 

The naturally occurring PTMs caused by isomerization or racemization of 
aspartate or asparagine residues from a- to B-forms [21] have been studied in bone 
turnover. In particular, the B-form of aspartate located in the C-telopeptide has been 
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shown to clearly reflect bone turnover [4,22]. Molecules suppressing bone turnover 
lead to accumulation of this PTM, whereas pathologies accelerating bone turnover 
reduce the level of the B-form and increase the level of the a-form [4]. Further stud- 
ies show that accumulation of the -form results in more rigid collagen structure 
and hence introduces stiffness in the bone. It has been speculated this stiffness is 
detrimental to the mechanical properties of the bone matrix [4]. 

In summary, type I collagen contains numerous PTMs, most of which are 
essential for the functionality of the fibrils. However, aging and some patholo- 
gies can introduce PTMs that are detrimental to collagens and could potentially 
serve as biomarkers of disease. 


BIOMARKERS OF TYPE I COLLAGEN 


Due to the high level of type I collagen in different tissues, particularly bone, 
and the well-documented turnover of type I collagen in various tissues [1], 
attempts have been made to measure type I collagen species in blood samples. 

For 20-25 years bone mineral density (BMD) measurements and fracture 
rates were the tools available to detect osteoporosis and monitor treatment 
responses and overall bone health. However, changes in BMD are small and 
fractures are rare, thereby rendering assessment of new osteoporosis drugs can- 
didates extremely slow and expensive [23]. 

In the mid-1990s, pioneers in the bone field working on collagen type I 
fragments demonstrated that these fragments provided insight into bone turn- 
over rates, while reflecting either bone resorption or bone formation [24-26]. A 
truly revolutionary finding was that the presence of collagen type 1 fragments 
very soon after patients initiated treatment with antiresorptive drugs indicated 
their response to these medications, and importantly was shown to predict BMD 
changes over multiple years [25,27]. 

The implementation of the below-described bone turnover markers altered 
the bone field dramatically and partially as a result, osteoporosis is now a dis- 
ease with multiple choices for treatment. This landmark example of how extra- 
cellular matrix turnover markers, in this case type I collagen turnover, led at 
least partially to additional treatments for osteoporosis, underscores the poten- 
tial clinical value of yet other extracellular matrix turnover markers for different 
diseases. 

The biomarkers related to type I collagen fall in two categories: synthesis 
biomarkers, such as amino-terminal propeptide of procollagen type I (PINP) 
and carboxy-terminal propeptide of procollagen type I (PICP); and degradation 
biomarkers, such as C-terminal telopeptide of type I collagen (CTX-D, type I 
collagen—derived crosslinked carboxy-terminal telopeptide; (ICTP), and type I 
collagen neoepitope (C1M) [28,29]. 

The collagen synthesis markers are the propeptides of type I collagen, which 
as described earlier are released during synthesis of the collagen molecule. 
Several studies have shown that these two markers reflect synthesis of bone 
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matrix to a large extent. PINP is recommended by the International Osteopo- 
rosis Foundation (IOF) and the International Federation for Clinical Chemistry 
and Laboratory Medicine (IFCC) to be used in clinical studies of drugs affecting 
bone turnover, due to the large number of studies confirming and validating its 
relevance [29,30]. PINP has been shown to be elevated 100-150% during treat- 
ment with bone anabolic drugs. In these patients, bone formation was confirmed 
by histomorphometry. However, the increase in PINP is detectable markedly 
faster [30] than other bone formation parameters; hence, it is used in most trials 
of drugs affecting bone turnover [28,29]. An interesting study in rats indicated 
that PINP does not solely arise from bone; it can also be released during fibrotic 
changes in the liver [31]. This finding correlates well with the widespread 
expression pattern of type I collagen and underscores the need to know which 
tissues are affected by a given drug to fully understand the PINP response [28]. 

PICP is also used in clinical trials of bone formation drugs where its levels 
also rise with bone formation. However, its accuracy has not been studied to the 
same extent as that of PINP [29]. All these different biomarkers of collagen type 
1 are visualized in Fig. 1.1. 

Type I collagen degradation biomarkers have been studied for around 
20 years due to their relationship to bone loss in osteoporosis [28]. Two types 
of type I collagen degradation markers exist: (1) cathepsin K—generated type 
I collagen fragments [CTX and collagen type 1 crosslinked N-telopeptide 
(NTX)] and (2) MMP-generated type I collagen fragments (C1M and ICTP). 
These markers reflect very different clinical aspects [28,32]. The cathepsin 
K-generated fragments reflect bone resorption and have been used extensively 
in clinical studies of antiresorptive drugs for osteoporosis [28]. The rationale 
for this resides within the highly specific combination of the enzyme cathepsin 
K, which is highly expressed by bone-resorbing osteoclasts, and type I col- 
lagen, which is present in massive quantities in bone [4,23,33]. Studies have 
carefully documented that CTX and NTX are generated directly by osteoclasts 
during bone resorption and that their levels reflect bone resorption in all systems 
from in vitro through in vivo to clinical studies (Fig. 1.2A—F) [28,34-36]. CTX 
has been studied extensively from applications in drug development programs 
in vitro, where it clearly is reduced as a function of antiresorptive compounds, 
and importantly where it correlates strongly with the presence of resorption 
pits on the actual bone slices (Fig. 1.2A—C) [34,37,38]. In addition, studies in 
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FIGURE 1.1 Schematic illustration of the localization of type I collagen neoepitopes. 
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FIGURE 1.2 (A) Illustration of C-terminal telopeptide of type I collagen (CTX-I) being released from an actively resorbing osteoclast (highlighted in black 
outline). (B) Pictures of resorption pits taken in the presence or absence of a chloride channel inhibitor at 90 uM, clearly showing the absence of pits when resorption 
is blocked. (C) Plots of CTX vs. pit area scored histologically in an experiment testing a dose response of a chloride channel inhibitor. (D) CTX-I levels in serum 
from the aged ovariectomized rat model over time. OVX +V indicates ovariectomy with vehicle, OVX +E indicates estrogen treatment, and sham indicates the control 
condition. The data illustrate the OVX-induced increase in CTX-I and also show that estrogen potently reduces CTX. (E and F) Data from a clinical study of alendro- 
nate showing the dose-dependent effects on CTX-I, which are present after 1-3 months and which are predictive of bone mineral density changes over 24 months (F). 
The figures are modified with permission from Karsdal MA, Henriksen K, Leeming DJ, Mitchell P, Duffin K, Barascuk N, et al. Biochemical markers and the FDA 
Critical Path: how biomarkers may contribute to the understanding of pathophysiology and provide unique and necessary tools for drug development. Biomarkers 
2009; 14:181—202; Ravn P, Clemmesen B, Christiansen C. Biochemical markers can predict the response in bone mass during alendronate treatment in early post- 
menopausal women. Alendronate Osteoporosis Prevention Study Group. Bone 1999;24:237-44; Ravn P, Hosking D, Thompson D, Cizza G, Wasnich RD, McClung 
M, et al. Monitoring of alendronate treatment and prediction of effect on bone mass by biochemical markers in the early postmenopausal intervention cohort study. 
J Clin Endocrinol Metab 1999;84:2363-8; Schaller S, Henriksen K, Sveigaard C, Heegaard AM, Helix N, Stahlhut M, et al. The chloride channel inhibitor n53736 
prevents bone resorption in ovariectomized rats without changing bone formation. J Bone Miner Res 2004; 19:1144-53. 
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the ovariectomized rat model of osteoporosis have clearly shown that CTX is 
elevated as a function of ovariectomy, whereas it responds to inhibitors of bone 
resorption such as estrogen, selective estrogen receptor modulators, and alen- 
dronate (Fig. 1.2D) [38a]. In the clinical setting, one of the seminal findings is 
from a study by Ravn et al., who demonstrated that reductions in CTX seen after 
just | month of therapy are predictive of BMD changes occurring over future 
years (Fig. 1.2E-F) [25,27]. 

However, since the fragments are rapidly cleared from the system and since 
bone resorption is affected both by food intake and diurnal variation, it is essen- 
tial that the fragments are measured in samples collected after fasting and in the 
morning [29,39,40]. Samples collected in this way have revealed that CTX-I 
levels, approximately 90% of which are derived directly from bone resorption, 
can with a high sensitivity provide information about the efficacy of antiresorp- 
tive drugs, indicate which patients will experience the fastest bone loss, and 
indicate risk for fractures [28]. Therefore, CTX-I is recommended as the bio- 
marker of choice for bone resorption by the IOF and the IFCC. 

Interestingly, the CTX-epitope contains an isomerization site. Two forms of 
the isomer, an a-form and a $-form, exist [4,22]. Studies have shown that the 
6-form not only accumulates with age but also as a function of agents that dra- 
matically lower bone turnover [41]. In contrast, studies have shown that in condi- 
tions dramatically accelerating bone turnover, such as osteolytic bone metastases, 
the levels of a-CTX were dramatically elevated [42,43]. Data support the a/B ratio 
is an indicator of bone type I collagen age. It has been speculated that this ratio 
can serve as an index of bone quality as a function of antiresorptive therapies [4], 
underscoring the clinical utility of these type I collagen degradation markers. 

The other collagen degradation markers are fragments generated by MMPs, 
and these markers are ICTP and C1M. ICTP has been studied over a long period. 
Although it initially was thought of as a bone resorption marker, it has now been 
clearly demonstrated that it does not derive directly from bone resorption, but 
is only generated by osteoclasts in conditions where the activity of cathepsin K 
is inhibited [44—46]. Under normal conditions, cathepsin K activity destroys the 
ICTP collagen fragment [47]. Other studies, however, have indicated that ICTP 
is elevated as a function of osteolytic tumors and rheumatoid arthritis [32,43]. 

CIM is a novel MMP-generated type I collagen fragment. It is not related to 
bone turnover as it is not released as a function of aggressive bone resorption as 
seen in osteolytic disease [48]. Studies have shown that C1M is closely related 
to chronic inflammation and therefore has potential as a biomarker across mul- 
tiple diseases, including rheumatoid arthritis, osteoarthritis, and various forms 
of fibrosis [48-52]. These data clearly support that MMP-mediated destruction 
of type I collagen is a highly pathologically relevant process and that monitor- 
ing fragments of type I collagen provides clinical value. 

A key aspect of type I collagen, found so abundantly in the body, is its post- 
translational modifications. These modifications are essential for correct syn- 
thesis and structural integrity of collagens, tissue-specific functionality, and 
application as biomarkers of different pathologies. 


Type I Collagen 
Gene name and 
number 

Mutations with 
diseases in humans 


Tissue distribution in 
healthy states 

Tissue distribution in 
pathological affected 
states 

Special domains 


Special neoepitopes 


Protein structure and 
function 


Binding proteins 


Known central function 


Animals models 


Biomarkers 
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Description 

COL1A1, location 17q21.3-q22 
COL1A2, location 7q21.3-22.1 
Osteogenesis imperfecta I-IV 
Ehlers—Danlos 

Caffey disease 

Ubiquitous 


Ubiquitous 


Like other fibrillar collagens it consist 

of three NC domains [1-3] plus two Col 
domains [1,2] 

N- and C-terminal propeptides and 

N- and C-terminal degradation peptides 
Type | collagen is a heterotrimer 
molecule and in most cases is composed 
of two a1 chains and one a2 chain, 
albeit an a1 homotrimer exists as a minor 
form. Each chain consists of more than 
1000 amino acids. Glycines at every 
third position of the helical domain are 
crucial for the helix structure 

Essential component for the mechanical 
competence of the bone extracellular 
matrix, but also a key structural 
component of many other tissues. Full 
function not yet clear 

Integrins, proteoglycans, and many more 
Main organic component of bone, 
indispensable for bone integrity 
COL1A2-deficient mice (oim mice), 
collagenase-resistant collagen | mouse 
a- and B-CTX-I, NTX, ICTP, PINP, PICP, 
CIM 


References 
Gene ID: 1277 
Gene ID: 1278 
53-58] 


273] 


2p 


[6,9,55,59-61] 


[6,9] 
[6,9] 


[2,3] 


[28,29] 


COL, Collagen; CTX-1, C-terminal telopeptide of type | collagen; NTX, collagen type 1 crosslinked 
N-telopeptide; ICTP, type | collagen-derived crosslinked carboxy-terminal telopeptide; P/CP, 
amino-terminal propeptide of procollagen type l; C7M, type | collagen neoepitope. 
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SUMMARY 

Type II collagen is a fibrillar collagen, and the main component of cartilage. Type II col- 
lagen is the cartilage collagen; it constitutes 95% of the collagens and approximately 60% 
of dry weight. Mutations in type H collagen result in several types of chondrodysplasia, 
leading to premature osteoarthritis. Type H collagen is typically coassembled with collagen 
XI, where it is covalently crosslinked to collagen IX and interacts with small leucine-rich 
proteoglycans. Its stability and strength provide the tissue with integrity and resiliency to 
stress. Type II collagen cleavage is primarily mediated by collagenases of the matrix metal- 
loproteinase family, resulting in well-described biomarkers such as C-terminal telopeptide 
of type II collagen and C2C. In addition, several formation makers have been developed. 
Type II collagen has important binding partners such as fibronectin and other collagens. 


Type II collagen is transcribed from the gene Col2A1, and it is mainly expressed 
in the extracellular matrix (ECM) of articular cartilage, in the intervertebral discs, 
and to a lesser extent in the vitreous humor of the eyes. Type II collagen belongs 
to the fibril-forming collagens and is composed of three identical a1(II) chains 
[a1 (D];. Like other fibril-forming collagens, it consists of a right-handed triple 
helix of about 300nm corresponding to about 1000 amino acids, and the triple 
helical (Gly-X-Y) repeat is the predominant motif [1]. This collagen molecule 
is about 1.5nm in diameter [2]. Within collagen fibrils, the molecules are stag- 
gered in an N-to-C pattern. Like other fibril-forming collagens, type II collagen 
is synthesized in a procollagen form of which the helical domain of the al(D 
chain is the major part of the molecule and is termed Coll. As the collagen is 
formed, the procollagen domains of each end are cleaved off by C-proteinase 
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and N-proteinase, respectively [3]. The C-propeptide is located at the C terminus 
and consists of a noncollagenous (NC)1 domain [4] comprised of three identical 
35-kD chains, which are covalently bonded [5]. The N-propeptide is located at 
the N terminus. It can be subdivided into the three domains: NC2, Col2, and NC3 
[4]. There are two splice variants of the N-propeptide of which type II collagen- 
derived N-terminal propeptide (PIIBNP) is usually the only variant expressed 
in healthy adults [6]. The other splice variant is known as type IIA procollagen 
amino terminal propeptide (PIIANP), and it is characterized by a prolongation 
of 69 amino acids in a cysteine-rich globular domain of the PIIBNP sequence. 
PIIANP expression is usually restricted to embryogenesis, but it is reexpressed 
in osteoarthritis [7]. An interesting feature of the PIIBNP fragment (but not the 
PITANP variant) is that it apparently inhibits osteoclasts survival as well as bone 
resorption [8]. Another study suggests that PIIBNP induces cell death in tumor 
cells via interaction with integrins [9]. 

Type II collagen is typically coassembled with type XI collagen. It is cova- 
lently crosslinked to type IX collagen and interacts with small leucine-rich pro- 
teoglycans that influence collagen fibrillar architecture and function [10]. Its 
stability and strength provide the tissue with integrity and resiliency to stress 
[1]. It is the main collagen expressed in cartilage, where it constitutes 95% of 
all collagens [11] and accounts for 60% of the dry weight of adult cartilage [12]. 
Damage to the fibrillar meshwork may be a critical event in the pathology of 
arthritis, in part, due to low collagen turnover within the cartilage, which makes 
it difficult to regenerate [13]. 

When Col2A1 is inactivated in breeding transgenic mice, heterozygous off- 
spring have a mild phenotype, whereas the monozygous mice die just before or 
shortly after birth. The cartilage of the monozygous mice is characterized by highly 
disorganized chondrocytes with a complete lack of extracellular fibrils discernible 
by electron microscopy. The mice lack both endochondrial bone and the epiphy- 
seal growth plate in long bones. Interestingly, many other skeletal structures, such 
as the cranium and ribs, are normally developed and mineralized. These findings 
demonstrate that a well-organized cartilage matrix is required as a primary tissue 
for the development of some components of the vertebrate skeleton, but it is not 
essential for others, such as the cranial bones [12]. In animal models of autoim- 
munity, type II collagen has been proven to be very efficient as an immunogen in 
mice [14] and rats [15] in collagen-induced arthritis (CIA). CIA has been studied 
extensively because of its similarities to rheumatoid arthritis [14]. 

More than 30 different mutations have been identified in the type II col- 
lagen gene locus, all of which are heterozygous mutations mapping the triple 
helical domain of the molecule. The general clinical phenotype is identified 
as spondyloepiphyseal dysplasia. The severity ranges from very mild types of 
precocious osteoarthritis to the moderately severe disorder Kniest dysplasia to 
very severe achondrogenesis type II and hypochondrogenesis. Mutations, in 
general, lead to short stature as in Kniest dysplasia or to degenerative arthritis 
as in Stickler dysplasia as well as eye and inner ear abnormalities [16]. 
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Type II collagen cleavage is primarily mediated by collagenases of the 
matrix metalloproteinase (MMP) family. Three of the known collagenases 
are MMP-1 (collagenase 1), MMP-8 (collagenase 2), and MMP-13 (collage- 
nase 3), which cleave type II collagen between residues 775 (glycine) and 776 
(leucine) [17]. Another cleavage site for MMP-1 is between residues 906 and 
907 [18]. MMP-13 is thought to be the key enzyme involved in the excessive 
cleavage in osteoarthritis of interleukin (IL)-1—stimulated cartilage destruction 
[19]. MMP-9 (gelatinase B) is also able to cleave type II collagen and generate 
several cleavage fragments [20]. A well-characterized neoepitope is CB12-II, 
which has been found to induce MMP-13 in chondrocytes through the nuclear 
factor-kB pathway [21]. CB12-II was originally derived from a cyanide bro- 
mide—generated peptide CB12, which corresponds to the 195-218 residues of 
bovine type II collagen [22]. 


BIOMARKERS OF TYPE II COLLAGEN 


As the main component of cartilage, evaluation of type II collagen degradation 
and formation has for several years been applied as a valuable tool for estimat- 
ing cartilage turnover [23]. Several candidate neoepitopes have been selected 
for this purpose, including the COL2-3/4m antibody, which recognizes the 
three-quarter piece of MMP-1-cleaved triple helix of collagen II between resi- 
dues 906 and 907 [18]. Several other neoepitopes have been identified and dif- 
ferent assays have been established to measure these fragments as an estimate of 
cartilage degradation. One of the fragments is the C1,2C (COL2-3/4Cshort) rec- 
ognizing the CGPOPQG (where O is hydroxyproline) sequence corresponding 
to the C terminus of the three-quarter length of the triple helical molecule. The 
levels of C1,2C are found to be significantly higher in a-chymotrypsin extracts 
of human osteoarthritic cartilage compared with extracts from nonarthritic 
patients. One weakness of the assay detecting this fragment was cross-reactivity 
toward peptides of types I and II collagen fragments [24]. The assay Col2- 
3/4m recognizes the sequence AOQGEAGROGPOGP, a fragment of the CB12-II 
sequence [18]. The C2C (Col2-3/4C jong mono) assay recognizes the CGPPGPQG 
fragment generated by intrahelical cleavage of collagenases. This sequence is 
located at the carboxyl terminus of the three-quarter piece of the degraded a1 (II) 
chain. The assay recognizes type II collagen cleaved by either MMP-1 or MMP- 
13. This cleaved collagen has been found to be significantly higher in the urine 
and serum of patients with rheumatoid arthritis compared with controls [25]. 
Another assay, type II collagen neoepitope (TIINE), uses the antibodies 9A4 and 
5109 to distinguish between osteoarthritis and healthy controls [26]. Whereas 
9A4 is specific for the C-terminal collagenase—generated neoepitope 776GPP- 
GPQG794 [27], 5109 recognizes the sequence GEPGDDGPS upstream from 
the sequence recognized by 9A4. Both C2C and 9A4 antibodies have the disad- 
vantage that their reactivity toward the neoepitope is affected by the hydroxyl- 
ation of Pro771 of the collagen [25,28]. Levels of Pro771 hydroxylations seem 
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to increase when type II collagen is pathologically affected [29,30]. A fragment 
from the N-terminal of type II collagen, GPPGPQG (which is also recognized 
by 9A4), is detected by the type II collagen neoepitope (CIINE) antibody when 
applied together with the 6G4 antibody raised against the CGEPGDDGPS in 
a sandwich enzyme-linked immunosorbent assay. This assay measures type II 
collagen fragments containing the selected neoepitope regardless of hydroxyl- 
ation of Pro771 or fragment length and with minimal cross-reactivity toward 
types I and HI collagen. CIINE has measured significantly elevated levels of 
collagen type II fragments in the urine of osteoarthritis patients compared with 
healthy controls [30]. The CartiLaps assay specifically measures cartilage deg- 
radation by quantifying C-terminal telopeptide of type II collagen (CTX-ID) at 
the six-amino acid sequence EKGPDP, derived from the type II collagen C-ter- 
minal telopeptide. CTX-II concentrations have been found to be significantly 
increased in urinary samples of rheumatoid arthritis and osteoarthritis patients 
compared with healthy controls [31]. C2M evaluates type II collagen degrada- 
tion by recognizing the RDGAAG fragment identified in the C-terminal region 
of the triple helical domain of type II collagen. The sequence was selected from 
mass spectrometry analyses of in vitro MMP-9-cleaved human cartilage [32]. 
This assay has been applied to differentiate between osteoarthritis and healthy 
controls [32] as well as in studies of rheumatoid arthritis [33] and patients with 
ankylosis spondylitis [34]. 
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In addition to evaluating cartilage degradation, it is of interest to detect 
whether new cartilage is being formed; thus, the type II C-terminal propeptide 
(CPI) assay was developed. It evaluates levels of the C-propeptide of type II 
procollagen, based on rabbit antisera raised against isolated C-propeptide of 
bovine type II procollagen. The levels of CPI are significantly elevated in sera 
of rheumatoid arthritis patients and decreased in osteoarthritis compared with 
healthy controls [5]. Subsequently, the type IIA procollagen amino-terminal 
propeptide (PITANP) assay was developed, which is sensitive to the PILANP 
splice variant of the N-terminal procollagen. PIIANP is based on a polyclonal 
antibody in rabbits and raised against the sequence of exon 2, published of type 
IIA procollagen [35,36]. PILANP is usually not expressed in healthy adults, and 
the assay for Pro-C2 was therefore established for evaluation of the PIIBNP 
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splice variant of the N-terminal procollagen with the sequence QDVRQPG [37] 
as well as a procollagen II N-terminal propeptide (PIINP) assay that is specific 
for the sequence GPQGPAGEQGPRGDR and does differentiate between the 
two splice variants [38]. However, so far the Pro-C2 and PIINP assays have only 
been validated in in vitro and ex vivo models. 


Type II Collagen Description References 
Gene name and COL2A1, location 12q13.11 Gene ID: 1280 
number 

Mutations with In utero/congenital lethal: [10] 

diseases in humans e Platyspondyly Torrance type 


e Achondrogenesis type II 
e Hypochondrogenesis 
Neonatal presentation: | 
Hypochondrogenesis (limited survival) 
e Spondyloepiphyseal dysplasia congenital 
e Kniest dysplasia 
e Stickler syndrome (cleft palate) 
e Spondyloperipheral dysplasia 
Infancy or childhood: | 
Spondyloepimetaphyseal dysplasia (Strud- 
wick type) 
e Stickler syndrome 
e Multiple epiphyseal dysplasia 
e Generalized arthritis 
Adult presentation: l 
Avascular necrosis of the femoral head 
e Premature osteoarthritis 
Null mutation in mice Disorganized chondrocytes, complete lack [12] 
of extracellular fibrils, no endochondrial 
bone or epiphyseal growth plate in long 
bones. Cranium and ribs were normal and 
mineralized. 
Tissue distribution in Nuclei pulposi and intraarticular menisci [13,39] 
healthy states 
Tissue distribution in NA 
pathological affected 


states 
Special domains Three NC domains (1-3) +two Col domains [1] 
(1 and 2) 
Special neoepitopes N- and C-terminal propeptides, and N- and [4,5] 
C-terminal 
Protein structure and Homotrimer of three collagen a1 [I]; [1,10,40] 
function chains. The helical region of each a chain is 


composed of 1014 amino acids, with glycine 
occupying every third position, where it is 
critical to the folding and stability of the helix. 
An essential component of the cartilage 
extracellular matrix, present in the 
developing cartilage anlagen and essential for 
endochondral bone formation 
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Type II Collagen Description References 

Binding proteins Anchorin Cll; fibronectin; types IX, XI, Ill, and [2,41-44] 
VI collagen 

Known central Main structural protein of articular cartilage; 13] 

function forms the backbone of the cartilage 
heteropolymeric fibrils 

Animals models Collagen induced arthritis (CIA) mouse and 14,15,45] 
rat models 

Biomarkers Degradation: C2M, CTX-Il, TIINE, CIINE, 24,25,28,30-32] 
C2C, C1,2C 
Formation: CPII, PIIANP, PIINP, Pro-C2 5,35,37,46] 


C1,C2; C2C; C2M; CIA, collagen-induced arthritis; Col, collagen; CPII, type Il C-terminal propeptide; 
CTX-II, C-terminal telopeptide of type Il collagen; NA, not applicable; NC, noncollagenous; PIIANP, 
type IIA procollagen amino-terminal propeptide; PIINP, procollagen II N-terminal propeptide; Pro-C2; 
TIINE and CIINE, type Il collagen neoepitopes. 
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SUMMARY 

Type II collagen is a fibrillar collagen, and it consists of only one collagen a chain, in 
contrast to most other collagens. It is a homotrimer containing three a1(III) chains super- 
coiled around each other in a right-handed triple helix. Type III collagen is secreted by 
fibroblasts and other mesenchymal cell types, thus making it a major player in various 
inflammation-associated pathologies such as lung injury, viral and nonviral liver diseases, 
kidney fibrosis, hernia, and vascular disorders. Type III collagen together with type I 
collagen are the main constituents of the interstitial matrix. Type III collagen mutations 
are associated with Ehlers—Danlos syndrome, vascular deficiency, and aortic and arterial 
aneurysms. Several biomarkers for type III collagen formation and degradation have been 
developed and used extensively. In particular, for fibrosis, type III collagen formations 
have proven valuable. 


Type HI collagen is encoded by the COL3A/ gene located on chromosome 2q31 
[1]. Type MI collagen belongs to the fibrillar collagen superfamily. It is a homotri- 
mer containing three a1(III) chains supercoiled around each other in a right- 
handed triple helix with a characteristic 1000-residue Gly-Xaa- Yaa repeat, where 
Xaa and Yaa are largely represented by the amino acids proline and hydroxypro- 
line, respectively. Glycine faces the center, whereas amino acids with longer side 
chains face outward, allowing close packaging along the central axis in the triple 
helix [2]. Type HI collagen is found extensively in connective tissues such as skin, 
lung, liver, intestine, and the vascular system, and it is often associated with type I 
collagen, except in bone [3-5]. Ultrastructural analysis of tissues from mice with 
mutations in the COL3A/ gene has shown that type III collagen is essential for 
normal type I collagen fibrillogenesis in the cardiovascular system, intestines, and 
skin [6]. 
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In the extracellular matrix (ECM), type II collagen constitutes the major 
part of the interstitial matrix together with type I collagen. During incorpo- 
ration into the ECM, N- and C-terminal propeptides are cleaved off. Cleav- 
age of the C-terminal propeptide from soluble procollagen precursors of the 
fibrillar collagen is mediated by bone morphogenic protein-1 (BMP-1) and 
the tolloid-like proteinases, and it is the rate-limiting step in the control of 
fibril assembly [7]. The processing of type III collagen can be enhanced by 
up to 20-fold by the procollagen C-proteinase enhancers (PCPEs). Evidence 
suggests that PCPE1 binds exclusively to the C-terminal propeptide region of 
the procollagen molecule and remains bound after cleavage by BMP-1. Iden- 
tification of the binding site of PCPE1 would enable the development of new 
strategies aimed at blocking this action and which could be a potential target 
for antifibrotic therapies [8]. 

Type III collagen is secreted by fibroblasts and other mesenchymal cell types, 
thus making it a major player in various inflammation-associated pathologies 
such as lung injury, viral and nonviral liver diseases, kidney fibrosis, hernia, 
and vascular disorders [9—13]. Scar tissue contains types I and III collagen with 
different levels of hydroxylation of lysine and glycosylation of hydroxylysine. 
During the process of wound healing the fibrillar collagens, including type III 
collagen, act as a scaffold for fibroblast attachment. This scaffolding changes 
their composition, ultimately leading to increased scar strength over time. Early 
granulation tissue contains mainly type HI collagen and only a small amount of 
type I collagen. As wound healing continues, this ratio is altered, leading to a 
type I/II ratio of 1:2, and the change in immature type II collagen may result 
in loss of tensile strength. This shift is observed in many conditions such as liver 
cirrhosis, keloids, and hypertrophic scars due to an increased expression of type 
HI procollagen mRNA [14]. 

Fibrillar procollagen C propeptide domains are associated with several 
genetic disorders of connective tissues. Mutations in this domain are particu- 
larly important as they are responsible for directing the assembly of the collagen 
molecule. In general, these mutations either prevent trimerization completely, 
leading to haploinsufficiency of the affected collagen type in heterozygotes, or 
they lead to abnormal procollagen assembly, which involves both the wild type 
and the mutant chain. One disease associated with mutations in this domain 
is Ehlers—Danlos syndrome (EDS) type IV, which leads to vascular deficiency 
[15] and aortic and arterial aneurysms. In this disease, mutations of genes may 
include deletion, missense mutations, and missplicing. The mutations cause dis- 
rupted formation or stability of the helix region, leading to misfolded helices. 
This in turn causes accumulation of defective trimers in the ECM [16]. Mouse 
models of EDS type IV produced by targeted deletion of COL3A/ have been of 
limited use as only 10% of homozygous animals survive to adulthood, whereas 
heterozygous animals die from arterial rupture [6]. A recent study identified a 
spontaneous 185-kDa deletion in the promoter region and exons 1-39 of the 
murine COL3A/ locus. This +/COL3A14 mouse model mimics the vascular 
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aspects of human EDS type IV with aortic dissections, which may result from 
aberrant collagen fibrillogenesis within the aortic wall [17]. 

Type II collagen serves as a ligand for several proteins, including G protein- 
coupled receptor-56 (GPR56), discoidin domain receptors (DDRs) | and 2, von 
Willebrand factor (VWF), and integrin «261. Binding to vWF and integrin 0261 
plays an important function in wound healing processes where platelets adhere to 
the injured vessel wall. Using collagen binding assays in vitro, researchers have 
identified the RGQOGVMGEF sequence (O is hydroxyproline) as the major high- 
affinity vWF-binding site on type II collagen and it is only found in one other 
protein, type II collagen. The sequence is 100% conserved in mouse, rat, and cow. 
However, in human type I collagen, a closely related sequence is found on the al 
chain, differing by a single O-to-A substitution in position 4, suggesting that VWF 
may bind types I, II, and HI collagen in an identical manner [18]. 

Interaction between GPR56 and type III collagen is essential in regulating 
cortical development. The interaction occurs in the developing brain, resulting 
in inhibition of neural migration due to a regulation of the pial basement mem- 
brane integrity and cortical lamination [19]. Type HI collagen serves as a ligand 
for both DDR1 and DDR2 at the GVMGFO motif [20]. Normal interaction 
controls cell proliferation, adhesion, and migration as well as ECM remodeling, 
whereas aberrant interaction is associated with various diseases such as fibrosis, 
atherosclerosis, several types of cancers, and arthritic disorders [21]. 


BIOMARKERS OF TYPE IlI COLLAGEN 


The N-terminal propeptide of type III collagen (PIIINP) is the best-known bio- 
marker of type III collagen synthesis. It has been used as a marker of ongo- 
ing repair processes independent of etiology [22-24]. In liver fibrosis, PHINP 
has shown to have acceptable diagnostic accuracy for liver fibrosis in patients 
with various chronic liver diseases such as alcoholic liver disease and chronic 
hepatitis C infection [25,26]. PIIINP has further been evaluated as a prognos- 
tic marker of liver fibrosis in limited studies. The largest prospective study of 
chronic hepatitis C progression is the HALT-C (Hepatitis C Antiviral Long-term 
Treatment against Cirrhosis) trial in which baseline levels of PIIINP were inde- 
pendently associated with disease progression [27]. 

In kidney fibrosis, two independent studies found PIINP to be highly 
correlated with the extent of interstitial fibrosis [28,29]. Even though PIIINP 
is mainly cleared in the liver and to a lesser extent in the kidney, the urinary 
PIIINP marker is believed to reflect intrarenal synthesis of type HI collagen 
since urinary PIIINP does not correlate with proteinuria [29]. 

Other markers of type II collagen include type III collagen neoepitope 
(C3M) and N-terminal propeptide of type III collagen (Pro-C3), which are novel 
markers based on protein fingerprint technology. The markers are designed to 
detect a specific cleavage fragment generated by disease-specific proteases, 
such as matrix metalloproteinase (MMP)-9 for C3M [30] and N-protease for 
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Pro-C3 [31]. Combining a protein with a protease in the assay ensures that the 
marker only detects the neoepitopes generated from the cleavage and not the 
intact protein. This means that measurements of subpools of the same protein 
can provide different information, emphasizing the importance of distinguish- 
ing one subpool from another [32]. 

Given the example of type III collagen, the PIIINP epitope of the N-terminal 
propeptide can be a marker of both formation and degradation since removal 
of the propeptide sometimes is incomplete leaving the propeptide attached to 
the molecule after it has been incorporated in the ECM [33]. However, tar- 
geting the N-protease cleavage site of the N-terminal propeptide, the Pro-C3 
marker measures true formation of type III collagen [31], whereas targeting the 
MMP-9-mediated fragment, the C3M marker measures degradation of type III 
collagen [30]. 
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The novel two collagen markers have proven useful as diagnostic, prog- 
nostic, and efficacy of intervention tools in various diseases. In chronic kidney 
disease the Pro-C3 and C3M markers can be used as noninvasive diagnostic and 
prognostic tools to monitor ECM remodeling in both experimental and human 
kidney fibrosis as the two markers have been shown to be elevated in advanced 
stages of chronic kidney disease [34,35]. 

In chronic liver disease, Pro-C3 and C3M have been able to detect the degree 
of fibrosis in patients with various underlying etiologies of fibrosis, including 
hepatitis C, alcoholic liver disease, portal hypertension, and human immunodefi- 
ciency virus [36-38]. In one study the markers have even been shown to provide 
different clinical information related to diagnosis and prognosis in patients with 
chronic hepatitis C infection. When stratified according to the fibrosis score, 
baseline serum levels of the two markers had similar diagnostic performances 
as three common biomarkers of liver fibrosis: aspartate aminotransferase, ala- 
nine aminotransferase, and Fibrotest. However, when stratifying the baseline 
marker levels to the changes in fibrosis scores after 52 weeks, Pro-C3 was the 
only marker which significantly differentiated progressors from stable patients. 
Furthermore, patients with high Pro-C3 levels at baseline had >4 times higher 
odds of being progressors of fibrosis than those with low baseline levels [39]. 

In bile duct—ligated rats, treatment with statins was associated with a reduc- 
tion in Pro-C3 levels in severe fibrosis stages [40]. In addition, in an ex vivo rat 
model using fibrotic precision-cut liver slices, the level of C3M was elevated 
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in the supernatant from fibrotic liver culture compared with the control liver 
culture. When adding a phosphodiesterase inhibitor, the level of C3M in the 
fibrotic liver culture was reduced to the same level as the control liver culture 
[41]. These experiments suggest that the markers may serve as a read-out for 
efficacy of interventions to reflect restoration of the collagen synthesis from 
increased levels during fibrosis to a balanced healthy turnover. 

Pathological type III collagen turnover has also been investigated in two types 
of lung diseases, idiopathic pulmonary fibrosis (IPF) and chronic obstructive pul- 
monary disease (COPD), by using Pro-C3 and C3M biomarkers. The first study 
included patients with mild COPD or IPF and healthy controls. Serum levels of 
C3M were significantly increased in mild COPD and IPF compared with the healthy 
control group [42]. This finding was confirmed in two larger studies [43,44]. Find- 
ings of the PROFILE (Prospective Study of Fibrosis In the Lung Endpoints) study, 
which is the largest prospective study investigating progression of IPF, showed that 
C3M was elevated in patients compared with healthy controls in both the discovery 
cohort and the validation cohort. C3M was also able to differentiate patients with 
progressive IPF from stable patients at 6months and high baseline levels were fur- 
ther associated with overall mortality [43]. A study including patients with COPD 
exacerbations showed that during exacerbations the serum level of C3M was ele- 
vated compared with the level at follow-up 4weeks later. Although no difference 
was observed for Pro-C3 during exacerbation compared with follow-up, the degra- 
dation/formation ratio was significantly elevated during exacerbation, suggesting an 
overall type III collagen destruction at these episodes [44]. 

Along with types I and II collagen, type III collagen is a main constituent of 
the joint. Increased levels of type II collagen fragments have provided novel 
insights into the connective tissue balance and possible pain mechanisms in rheu- 
matoid arthritis (RA) and osteoarthritis (OA). The strong correlation between 
C3M and the MMP-generated fragment of C-reactive protein (CRP) observed 
in OA patients suggests that the increased tissue turnover is a result of local 
inflammatory processes in the synovial membrane of the joint [45,46]. Interest- 
ingly, in the Tocilizumab Safety and the Prevention of Structural Joint Damage 
(LITHE) biomarker study including more than 700 patients with moderately to 
severely active RA, C3M serum levels were reduced by ~70% in patients treated 
with 8 mg/kg tocilizumab after 52 weeks. Furthermore, a significant difference 
in circulating C3M levels was observed between early nonresponders, nonre- 
sponders, and responders, suggesting that specific responder profiles exist [47]. 

Together, these studies show that regardless of whether the disease is associ- 
ated with inflammation or fibrosis, the serum concentrations of protein frag- 
ments generated by specific proteases are increased in patients compared with 
healthy individuals. Furthermore, elevated levels of either one of the markers or 
both were associated with disease progression. This suggests that the pathologi- 
cal ECM remodeling can be monitored in different stages, from early disease 
onset with inflammation to the more advanced disease stages where fibrosis is 
present and that ECM remodeling is a common denominator in many diseases. 
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Type III Collagen Description Reference 
Gene name and number COL3A1, chromosome 2q31 [1] 
Mutations with diseases  Single-base mutations cause amino acid [15,48,49] 
in humans substitutions, RNA splicing mutations, 
and partial gene deletions, causing 
vascular EDS (moderate-to-severe form). 
Homozygosity for the null allele causes 
recessive EDS. 
Null mutation in mice N.A. 
Tissue distribution in Skin, vessel walls, lung, liver, and [3-5] 
healthy states spleen, among others, along with type | 
collagen (except in bone) 
Tissue distribution in Connective tissue fiber hyperplasia, [9-12,50,51] 
pathologically affected upregulated in fibrotic tissue (lung, liver, 
states kidney, vascular system) 
Special domains vWF type C domain, fibrillar collagen [52] 
C-terminal domain 
Special neoepitopes N- and C-terminal propeptides [53] 


Protein structure and 
function 


Binding proteins 


Homotrimer consisting of three a1 (III) 
chains flanked by N- and C-terminal 
propeptides 

GPCR56, PCPE-1, DDR1, DDR2, vWF, 
integrin @2B1, LAIR-1, PDGF 


[3] 


[8,19,20,54,55] 


Known central function — Cell-matrix adhesion, collagen fibril [18,56-58] 
organization, ECM organization, 
response to TGF-f1 
Animals models with +/COI3A1 knockout [17] 
protein affected 
Biomarkers PIIINP, Pro-C3, C3M [31,59,60] 


EDS, Ehlers-Danlos syndrome; N.A., not applicable; vWF, von Willebrand factor; CPCR, G pro- 


tein-coupled receptor; PCPE, procollagen C-proteinase enhancer; DDR, discoidin domain receptor; 
LAIR, leukocyte-associated immunoglobulin-like receptor; PDGF, platelet-derived growth factor; 
ECM, extracellular matrix; TGF, transforming growth factor; P/IINP, N-terminal propeptide of type 
Ill collagen; Pro-C3, N-terminal propeptide of type III collagen; C3M, type Ill collagen neoepitope. 
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SUMMARY 

Type IV collagen is the main collagen component of the basement membrane. It is a 
network-forming collagen that underlies epithelial and endothelial cells and functions as 
a barrier between tissue compartments. Type IV collagen has many binding partners and 
forms the backbone of the basement membrane. It holds important signaling potential 
as subdomains such as tumstatin are released when the protein is degraded by special 
proteases. Consequently, type IV collagen is both the most important structural collagen 
of the basement membrane and it entails key signaling potential, which is important for 
various physiological and pathological functions. The most well-studied mutations in type 
IV collagen cause Alport syndrome, a chronic kidney disease. Several biomarkers of type 
IV collagen have been developed, both formation and degradation fragments as well as 
whole domains such as 7S, documenting the importance of type IV collagen turnover in 
most, if not all, connective tissue diseases. 


Type IV collagen is a crucial structural component of the basement mem- 
brane [1], a specialized sheet-like extracellular matrix (ECM) that underlies 
epithelial and endothelial cells and functions as a barrier between tissue com- 
partments. Type IV collagen is a network-forming collagen that provides a 
molecular scaffold and interacts with cells, growth factors, and other base- 
ment membrane components such as laminin, nidogen, and perlecan [2-7]. 
The specific interactions result in a specialized basement membrane, unique 
for each tissue, which is involved in several important biological processes 
including cell adhesion, migration, development, tissue regeneration, and 
wound healing [8]. Six different a chains of type IV collagen, al—a6, are 
expressed in vertebrates [9]. They are encoded by the genes COL4A1 to 
COL4A6, which are organized into pairs in a head-to-head manner [9-12]. 
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Each a chain consists of a short N-terminal collagenous 7S domain, a central 
collagenous domain of approximately 1400 residues, and a C-terminal globu- 
lar non-collagenous (NC) | domain of approximately 230 residues [8]. The 
collagenous domain of type IV collagen differs from that found in the fibrillar 
collagens as it contains 21—26 characteristic interruptions of the proline-rich 
Gly-Xaa-Yaa triple repeat [8]. These frequent interruptions result in a less 
rigid structure and provide type IV collagen with extensive flexibility that 
enables the formation of a network and may allow for specific proteolytic 
cleavages of the triple helix or serve as sites of cell binding and interchain 
crosslinking [13]. Three a chains self-assemble into a heterotrimeric protomer 
by the association of their NC1 domains and the subsequent supercoiling of 
the triple helical domains in a zipper-like manner. The association of the NC1 
domains is stabilized by domain-swapping interactions [14], and the presence 
of recognition sequences ensures the formation of only three different pro- 
tomer isoforms: alala2(IV), a3a4a5(IV), and aS5a5a6(IV) [15]. After secre- 
tion to the ECM, the protomers self-assemble into complex lattice-shaped 
networks. The NC1 domain of two protomers associate end-to-end, forming 
an NCI hexamer, and the 7S domains of four protomers form a 7S tetramer 
[16,17]. The 7S domain is rich in cysteine and lysine residues that stabilize the 
7S tetramer by forming disulfide bonds and lysine—hydroxylysine crosslinks. 
Furthermore, the 7S tetramer is heavily glycosylated, making it resistant to 
collagenase activity [18]. The NC1 hexamer, although containing conserved 
cysteine residues, has been shown to be stabilized by covalent Met—Lys cross- 
links with only intrachain disulfide bonds [14,19,20]. 

The type IV collagen networks consist of alala2(IV), a3a4a5(I[V), or a 
combination of alala2(IV) and aS5a5a6([V) connected through their NC1 
domains [21]. The predominant network alala2(IV) is found in basement 
membranes of all tissues, whereas the other networks show restricted tissue 
distribution. «304a5(IV) has been identified in the glomerular basement mem- 
brane of the kidney and in the alveolar basement membrane of the lung as 
well as in basement membranes of testis, inner ear cochlea, and eye, whereas 
a5a5a6(IV) is expressed in the basement membrane of the bronchial epithe- 
lium, smooth muscle cells, skin, and Bowman capsule and tubular basement 
membrane of the kidney [8,15,22—26]. Throughout embryonic development, the 
ubiquitous alala2(IV) is replaced by a3a4a5(IV) and a5a5a.6(IV) in specific 
tissues in a developmental switch [24]. In contrast to the alala2(IV) network, 
the a3a4a5(IV) has a high degree of crosslinking and resistance to proteolysis 
[27] that may enable it to withstand increased stress in locations such as the 
mature glomerular and alveolar basement membranes. 

Inherited mutations in the genes encoding different type IV collagen a 
chains have been implicated in several severe diseases mainly showing cere- 
brovascular and renal manifestations. Mutations in the genes encoding the 
al(IV) and a2(IV) chains are mainly expressed as vascular defects resulting 
in, for example, intracerebral hemorrhages and intracranial aneurysms [28]. 
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Patients with hereditary angiopathy with nephropathy, aneurysm, and muscle 
cramps syndrome show mutations in the COL4A1 gene, causing a systemic 
phenotype due to the widespread distribution of the al(IV) chain and involv- 
ing both renal, vascular, and ocular manifestations [29]. Alport syndrome is 
caused by mutations in the a3(IV), a4(IV), or aS(IV) chain, resulting in the 
absence of the a3a4a5(IV) network. Several variations of Alport syndrome 
exist, but classically it consists of hematuria, proteinuria, progressive renal 
failure, and sensorineural deafness [30]. Patients retain a fetal distribution of 
the alala2(IV) network instead of switching to a304a5(IV) during develop- 
ment, leaving especially the glomerular basement membrane susceptible to 
proteolytic degradation [27]. Type IV collagen is the target of two autoim- 
mune diseases affecting the kidney: Goodpasture syndrome and Alport post- 
transplantation disease. Both diseases are characterized by autoantibodies 
attacking the glomerular basement membrane and causing rapidly progressive 
glomerulonephritis [31]. In addition, patients with Goodpasture syndrome 
present with lung hemorrhage as the autoantibodies also attack the alveolar 
basement membrane. These insults result from autoantibodies directed at 
cryptic epitopes in the NC1 domain of the a3(IV) chain, which are present in 
both the glomerular and alveolar basement membranes [30]. The cryptic sites 
are inaccessible in the native protein conformation and only become acces- 
sible to autoantibodies after the dissociation of the NCI hexamer caused by, 
for example, oxidative stress [32]. 

A specific 28-kDa peptide from the NC1 domain of the a3(I[V) chain 
known as tumstatin have anti-angiogenic properties and is able to suppress 
tumor growth [33]. Matrix metalloproteinase (MMP)-9 controls the physi- 
ological levels of tumstatin by cleaving it from type IV collagen in the base- 
ment membrane [33]. The inhibitory effects of tumstatin on tumor growth 
have been studied for several years. Tumstatin can directly bind to the tumor 
endothelium, and its anti-angiogenic action is carried out through the interac- 
tion with avß3 integrin on proliferating endothelial cells [34]. Furthermore, 
tumstatin levels in the airways of patients with asthma are reduced, which 
may contribute to disease progression as tumstatin suppresses airway angio- 
genesis, hyperresponsiveness, and inflammation [35]. The use of tumstatin 
as a therapeutic agent has been suggested for diabetic nephropathy [36] and 
cancer [37]. Tumstatin can also be a diagnostic, prognostic, and therapeutic 
marker of renal carcinoma [38]. Further investigations of fragments of type IV 
collagen have revealed that not only tumstatin but also several other peptides 
released from type IV collagen may exert biological activities that are differ- 
ent from those of the intact protein. Such peptides deriving from the ECM are 
known as matrikines. Matrikines derived from the NC1 domain of both the 
al (arresten), a2 (canstatin), a3 (tumstatin), a4 (tetrastatin), a5 (pentastatin), 
and a6 (hexastatin) chains of type IV collagen have been identified and shown 
to inhibit tumor progression by, for example, anti-angiogenic effects [33,39-41]. 
Interestingly, the work on matrikines indicates that fragments of type IV collagen 


34 Biochemistry of Collagens, Laminins and Elastin 


can be more than just products of disease, but may play an active role in sup- 
pressing or sustaining a pathological state. 


BIOMARKERS OF TYPE IV COLLAGEN 


The tissue-specific nature of the «3-a6(IV) chains introduces the opportunity 
to use selected peptides of type IV collagen as tissue-specific biomarkers of 
disease. Domains and fragments originating from specific a chains of type IV 
collagen released as a result of protein remodeling have been widely investi- 
gated for their use as biomarkers of various diseases of the ECM. In addition to 
the matrikines, specific fragments of type IV collagen produced by MMPs in 
relation to protein degradation have been investigated for their potential as bio- 
markers of fibroproliferative diseases. Circulating levels of an MMP-generated 
fragment of the a1 chain, C4M, have been associated with the presence of anky- 
losing spondylitis [42], liver fibrosis [43—45], portal hypertension [46], pancre- 
atic cancer [47], and chronic obstructive pulmonary disease (COPD) [48,49]. 
Interestingly, C4M has also been found to be elevated in serum of patients with 
COPD having an acute exacerbation compared to stable COPD, indicating 
an association with disease activity [49]. Elevated circulating levels of the 7S 
domain, believed to reflect type IV collagen formation, have been associated 
with fibrosis of the liver [50-52] and lung [53] as well as an increased mortality 
risk in patients undergoing hemodialysis [54]. 

Type IV collagen is also a biomarker target in urine of patients with different 
kidney disease manifestations. In type 1 diabetic nephropathy, the estimated glo- 
merular filtration rate declined more rapidly in patients with an elevated urinary 
type IV collagen-to-creatinine ratio (T4C), but with normal albumin-to-creat- 
inine ratio, than patients with normal T4C [55]. A proteomic study observing 
the levels of ECM protein fragments in patients with type | diabetes presenting 
a progressive early decline in renal function showed a decreased expression 
of urinary fragments of the a1(IV) chain compared with control subjects with 
stable renal function [56]. In type 2 diabetic patients, increased type IV collagen 
urine excretion was associated with the severity of morphological alterations 
in kidney fibrosis [57]. Moreover, elevated levels of urinary type IV collagen 
were correlated with urinary proteins, urinary N-acetyl-$-D-glucosaminidase, 
and selectivity index in a study on biopsy-proven membranous nephropathy 
and anti-neutrophil cytoplasmic antibodies—associated glomerulonephritis [58]. 


Type IV Collagen Description References 

Gene name and COL4A1 (13q34); NCBI gene ID: 

number COL4A2 (13q34); 1282, 1284, 
COL4A3 (2q36-q37); 1285, 1286, 


COL4A5 (Xq22); 


( 
( 
COL4A4 (2q35-q36); 1287, and 1288 
( 
COL4A6 (Xq22); 
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Description 


e COL4A1: familial porencephaly, intrace- 
rebral hemorrhage, hemorrhagic stroke, 
small-vessel disease, intracranial aneurysms, 
HANAC syndrome, retinal artery tortuosity. 

e COL4A2: familial porencephaly, intracere- 
bral hemorrhage, hemorrhagic stroke, small- 
vessel disease. 

e COL4A3: autosomal recessive Alport syn- 
drome, familial benign hematuria. 

e COL4A3/COL4A4: focal segmental glomeru- 
losclerosis and renal failure in thin basement 
membrane nephropathy. 

e COL4A4: autosomal dominant Alport syn- 
drome, familial benign hematuria. 

e COL4AS: X-linked Alport syndrome, diffuse 
leiomyomatosis. 

e COL4A6: diffuse leiomyomatosis, X-linked 
non-syndromic hearing loss. 

e COL4A1 KO, COL4A2 KO, and COL4A1/ 
COL4A2 double KO are embryonically lethal. 

e COL4A3 KO is a model for autosomal Alport 
syndrome. 

e alala2(IV): basement membranes of all tis- 
sues. 

e a3a4a5(IV): basement membranes of the 
glomeruli of the kidney; alveoli of the lungs, 
testis, inner ear cochlea, and eyes. 

e a5a5a6(IV): basement membranes of bron- 
chial epithelium, smooth muscle cells, skin, 
and Bowman capsule and tubules of the 
kidney. 

e Alport syndrome: loss of the a304a5(IV) 
network in kidneys and lungs. 

e Tumors: loss of 03-06 chains; loss or accumu- 
lation of a1 and a2, depending on tumor type. 

e Fibrosis: accumulation in liver and kidney, 
disruption in lung. 

N-terminal 7S domain; collagenous domain with 

21-26 interruptions; C-terminal NC1 domain 

e NC1 domain of a1 (arresten), a2 (canstatin), 
a3 (tumstatin), a4 (tetrastatin), a5 (penta- 
statin), and a6 (hexastatin). 

e 7S domain. 

e Degradation fragments (eg, C4M) 

Three a chains form the heterotrimeric 

protomers a1a1a2(IV), a3a4a5(IV), and 

a5a5a6(IV). Protomers associate via two NC1 
and four 7S domains to form lattice-shaped 
networks in basement membranes. 

Integrins, DDR1, nidogen, laminin, heparan 

sulfate proteoglycan, fibronectin, TGF-B, PDGF. 
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Type IV Collagen Description References 
Known central Underlies endothelial and epithelial cells, [8] 
function separates tissue compartments; surrounds 
various cells including smooth muscle and nerve 
cells; functions in cell adhesion, migration and 
development, tissue regeneration, and wound 
healing. 
Animals models e COL4A3 KO mouse model for autosomal [72,79,80] 
with protein Alport syndrome 
affected e COL4A4 splice site mutation resulting in 
mutant a4 chain expression in tissue as 
mouse model of autosomal Alport syndrome 
e COL4AS nonsense mutation resulting in 
male KO and female mosaic for a5(IV) chain 
expression as mouse model of X-linked 
Alport syndrome 
Biomarkers C4M, C4Ma3, P4NP 7S, 7S domain, NC1 ([33,39-41, 


domain (including arresten, canstatin, tumstatin, 43,44,51,52, 
tetrastatin, pentastatin, hexastatin). 81,82] 


COL, Collagen; DDR, discoidin domain receptor; HANAC, hereditary angiopathy with nephropa- 
thy, aneurysms, and muscle cramps; KO, knockout; NC, non-collagenous; NCBI, National Center 
for Biotechnology Information; PDGF, platelet-derived growth factor; TGF, transforming growth 
factor. 
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SUMMARY 

Type V collagen is a fibrillar collagen. Type V collagen is essential for fibrillation of types I 
and III collagen, and consequently for optimal fibrillary formation and tissue quality. Type V 
collagen contributes to the bone matrix; corneal stroma; and the interstitial matrix of muscles, 
liver, lungs, and placenta. Dysregulation of collagen fibrillogenesis is a hallmark of several 
subtypes of Ehlers—Danlos syndrome (EDS). Around 50% of patients with classic EDS 
harbor a heterozygous mutation in type V collagen, causing connective diseases as a result 
of decreased matrix quality. Collagen type V knockout mice synthesize and secrete normal 
amounts of type I collagen, but collagen fibrils are virtually absent and mice die at the onset 
of organogenesis, indicating that normal fibrillogenesis, such as that produced by type V 
collagen, is crucial. Several biomarkers of type V collagen formation and degradation exist, 
which have proven type V collagen to be of special importance in connective tissue diseases. 


Type V collagen is also a member of group I collagens known as fibrillar-forming 
collagens and is ubiquitously distributed throughout the body. Type V collagen 
binds to DNA, heparan sulfate, thrombospondin, heparin, and insulin. It is a key 
determinant in the assembly of tissue-specific collagen matrices such as those 
formed by types I and III. Type V collagen exists as a1, a2, and a3 chains, and each 
structure contains two propeptides, PSNP (at the N terminus) and P5CP (at the C 
terminus). The most abundant isoform of type V collagen is the [al(V)]2a2(V) 
heterotrimer [1]. During fibril formation, this heterotrimer is incorporated into 
type I collagen fibrils and in doing so increases the diameter of the fibrils by par- 
tial retention of the a1(V)-N-propeptide [1,2]. It has been proposed that the total 
type V collagen triple helix is buried within the fibril, in contrast to type I col- 
lagen, which is present along the fibril surface. The a1(V)-N-propeptide extends 
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outward from the gap zones of the heterotypic fibrils and is thus the only part of 
the type V collagen molecule that emerges at the surface of these fibrils [1]. 

A characteristic of type V collagen is the partial retention of the N-propep- 
tide extension. In its normal healthy state type V collagen is resistant to mam- 
malian collagenases and sensitive to trypsin [3]. It contributes to the organic 
bone matrix; corneal stroma; and the interstitial matrix of muscles, liver, lungs, 
and placenta [3]. Collagen knockout mice (col5a1 —/—) synthesize and secrete 
normal amounts of type I collagen, but collagen fibrils are virtually absent and 
mice die at the onset of organogenesis [4], indicating that normal fibrillogen- 
esis, such as that produced by type V collagen, is crucial. 

Ehlers—Danlos syndrome (EDS) is a group of heritable connective tissue 
disorders that share common features of skin hyperextensibility, articular hyper- 
mobility, and tissue fragility [5]. Dysregulation of collagen fibrillogenesis is 
a hallmark of several subtypes of EDS. Around 50% of patients with classic 
EDS harbor a heterozygous mutation in COLS5A/ or COLSA2, encoding the 
al(V)- and the «2(V)-collagen chains, respectively. The prevalence of classic 
EDS has been estimated to be 1:20,000 [6]. The diagnosis of the classic type is 
established by clinical examination and family history [5]. 

Six main descriptive types were substituted for earlier types numbered with 
Roman numerals: classic type (EDS I and EDS ID), hypermobility type (EDS 
HI), vascular type (EDS IV), kyphoscoliosis type (EDS VI), arthrochalasia type 
(EDS VIIA and VIIB), and dermatosparaxis type (EDS VIIC). The key features 
of classic EDS I and EDS II are joint laxity and fragile, atrophic scarring skin. 

Interactions between Schwann cells and neurons within the extracellular 
matrix are important for many aspects of peripheral nervous system develop- 
ment and function. Type V collagen has been associated with adhesions to hepa- 
ran sulfate in Schwann cells, inhibiting the outgrowth of axons from rat neurons 
and promoting Schwann cell migration. 


BIOMARKERS OF TYPE V COLLAGEN 


Few biomarkes for type V collagen exist, although novel markers have been 
developed to evaluate the role of type V collagen in fibrosis-related diseases 
(Column 5 of Table 1). The Pro-C5 marker is an excellent example of a serolog- 
ical marker developed specifically to reflect true type V collagen formation [7]. 
Fibril-forming collagens are synthesized as precursor molecules with propep- 
tide extensions at the N and C termini of the molecule. The mature propeptide is 
cleaved from procollagen by N-terminal or C-terminal proteinases, and mature 
collagen is integrated into the extracellular matrix [8,9]. The removal of the 
propeptide may be incomplete, leaving the propeptide attached to the molecule, 
thereby resulting in thin fibrils with abnormal crosslinks and susceptibility to 
rapid metabolic turnover [10,11]. Thus an internal procollagen epitope can be a 
marker of both formation and degradation. Pro-C5 solely measures type V col- 
lagen formation as that epitope is released when the propeptide is cleaved from 
the main protein during protein synthesis and incorporation into the matrix. The 
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measured epitope is generated during this protease cleavage and is consequently 
an N-positioned neoepitope of the native cleaved collagen [12]. Another novel 
marker for type V collagen is a marker used to evaluate the matrix metallopro- 
teinase degradation of type V collagen by using a monoclonal antibody specific 
toward a type V collagen fragment released into circulation in patients and in 
preclinical models [13]. Both the Pro-C5 and C5M markers have been shown 
to be correlated to portal hypertension in patients with chronic liver disease and 
have proven useful in preclinical models of liver disease [12,14,15]. C5M is also 
elevated in patients with ankylosing spondylitis compared with healthy controls 
[13] and decreased in patients with a hernia compared with controls [16]. 


Table 1 Type V 
Collagen 

Gene ID (NCBI) 
Collagen 
superfamily 
mutations with 
diseases in 
humans 


Null mutation in 
mice 


Tissue distribution 
in healthy states 


Tissue distribution 
in pathological 
affected states 
Special domains 


Special 
neoepitopes 


Description 
1289 (a1), 1290 (a2), 50509 (a3) 
Fibrillar collagen family 
Ehlers—Danlos syndrome type 1: 
COL5A1, gene MIM number 120215; 
COL5A2, gene MIM number 120190 
Ehlers—Danlos syndrome type 2: 
COL5A1, gene MIM number 120215; 
COL5A2, gene MIM number 120190 
Col5 a1 mutation mice synthesize and 
secrete normal levels of type | collagen, 
but collagen fibrils are absent and mice 
die at the embryonic stage 
Col5 a2 mutation mice survive 
poorly, maybe due to complications 
from spinal deformities, and exhibit 
skin and eye abnormalities 
1. colon, endometrium, skeletal 
muscle, lymph node, tonsil, thy- 
roid gland, lung, cornea, bone, 
fetal membranes 
2. NA: 
3. small intestines, testis, cervix, 
tonsil, parathyroid gland 
Found in interstitial tissue, associated 
with type I, dermis, tendon, cornea, 
lung fibrosis 
Signal peptide; TSPN-1 domain; VAR; 
COL1/COL2 domains; NC1 domain; 
NC2 domain; Laminin G-like 
domain; fibrillar collagen domain; 
VWFEC binding; collagen-like-1, -2, 
-3, -4, -5, -6 
G439-V in a1/a2 and G455-L in a3 
all by MMP-9; $254-Q and D1595-D 
by BMP-1; R1585-N by furin; 
P/A438-E by ADAMTS2 
a1: T1584-R; P924-R; a2: G355-Q, 
G229-Q; 03:G1317-H, G525-R 
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Type V Collagen Description References 
Protein structure Type V collagen has three varieties 
and function of a chains: a1 (COL5A1), a2 


(COL5A2), and a3 (COL5A3), 
forming a triple helix containing all 
three chains 


Binding proteins Type V collagen typically forms 


heterofibrils with types | and III 


collagens 
Known central Essential for fibrillation of types! and [3] [25] 
function III collagen 
Animals models Col5a1 and col5a2 absent [24,25] 
with protein 
affected 
Biomarkers Pro-C5, C5M [7,13] 


NCBI, National Center for Biotechnology Information; M/M, Mendelian Inheritance in Man; N.A., 
not applicable. 
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SUMMARY 

Type VI collagen is a unique beaded filament collagen. It is found in the interface between 
the basement membrane and interstitial matrix, where it forms a unique microfibrillar net- 
work. Type VI collagen has many binding partners. Mutations in type VI collagen have, 
in particular, been associated with muscle weakness disorders such as Bethlem myopathy 
and other muscle dystrophies. In addition to the structural and binding parameters of type 
VI collagen, the C-terminal propeptide is a hormone called endotrophin, associated with 
the metabolic syndrome. Consequently, type VI collagen is both a structural and a signal- 
ing protein. Furthermore, type VI collagen may act as an early sensor of the injury/repair 
response and may regulate fibrogenesis by modulating cell-cell interactions; stimulate 
the proliferation of mesenchymal cells; and prevent cell apoptosis. Biomarkers of type VI 
collagen formation and degradation have been developed, which have been shown to be 
regulated in connective diseases. 


Type VI collagen is a unique beaded filament collagen. Type VI collagen forms a 
unique microfibrillar network and is expressed in the extracellular matrix (ECM) 
of many tissues, including the dermis, skeletal muscle, lung, blood vessels, cor- 
nea, tendon, skin, cartilage, intervertebral discs, and adipose [1—4]. Type VI 
collagen is found near the basement membrane and has several binding sites in 
that membrane [5]. 

Type VI collagen is composed of three different chains: al, a2, a3. Each 
chain contains a short collagenous region and globular regions at both the N and 
C terminus [6]. Both al and a2 chains contain one N-terminal subdomain and 
two C-terminal subdomains, which are highly homologous to type A domains 
of von Willebrand factor (vWF-A) [7]. The «3 chain is much larger than the al 
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and a2 chains. It contains 10 N-terminal subdomains (N1I—N10), two C-terminal 
vWF-A-like subdomains (C1 and C2), and three specific C-terminal subdo- 
mains (C3-C5) [8]. Type VI collagen forms polymers intracellularly before 
secretion to the ECM. Dimers are formed by two monomers in an antiparallel 
way. Subsequently, the dimers form tetramers in a side-by-side parallel mode. 
Then, type VI collagen is secreted into the ECM where the microfibril forms 
[6,9]. Several subdomains in the N and C terminus are involved in the triple helix 
and microfibril formation. It is reported that the C1 domain plays critical roles 
in chain selection and assists the triple helix formation [8,10]. The N5 domain 
of the «3 chain is suggested to be involved in the assembly of microfibrils [11]. 
The C5 domain of the a3 chain, also called endotrophin, is important for mature 
microfibril formation [8] and is immediately cleaved off from the mature type 
VI microfibril after secretion [8,12]. The in vivo protease or possible prote- 
ases induce the release of endotrophin, and the cleavage sites are still unknown. 
However, it is suggested that the cleavage site is between the Cl and C2 
domains [8]. In one study, an increase of incorrectly folded type VI collagen was 
found surrounding adipocytes in matrix metalloproteinase MMP-11-deficient 
mice [13]. Therefore, MMP-11 is suggested to be involved in the C5 cleavage 
process. However, this speculation needs to be verified. 

Recently, three more chains, a4, a5, and a6, were discovered that are 
homologous to the a3 chain [14-16]. The a4 chain is absent in humans due 
to the disruption by a chromosome break in the COL6A4 gene [16]. All three 
new chains are suggested to potentially substitute for the a3 chain in type VI 
collagen [15,17]. However, this substitution is still debated because a5 and a6 
chains are also expressed in the regions where al, «2, and a3 chains are not 
expressed [16]. 

Type VI collagen binds to a wide range of ECM proteins, including type 
IV collagen [5], decorin [18], biglycan [19], perlecan [20], NG2 proteoglycan 
[21], fibronectin [22], tenascin [23], and integrin [24]. It also associates with 
collagen fibrils, potentially by mediation of proteoglycans [25,26]. In addi- 
tion, a series of cells including fibroblasts, chondrocytes, hematopoietic cells, 
and tumor cells can bind to type VI collagen through the cell binding sequence 
Arg-Gly-Asp in the triple helix domain [27-30]. It is widely accepted that 
type VI collagen aids cell attachment and connects related tissues to the sur- 
rounding matrix [3,31]. Except for the connection functions, type VI collagen 
may act as an early sensor of the injury/repair response and may regulate 
fibrogenesis by modulating cell-cell interactions. It could stimulate the 
proliferation of mesenchymal cells and prevent cell apoptosis [32,33]. The 
persistent activation of mesenchymal cells into myofibroblasts could eventu- 
ally induce ECM deposition and tissue fibrosis [34]. In lung fibrosis, type VI 
collagen is found to be increasingly expressed [35]. The newly found a6 chain 
is upregulated in fibrotic areas of Duchenne muscular dystrophy muscle [36]. 
In addition, endotrophin can trigger adipose tissue fibrosis [37]. It also con- 
tributes to inflammation and angiogenesis, thereby facilitating tumor growth 
and progression [38]. 
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Mutations COL6A1, COL6A2, and COL6A3 usually cause muscle weak- 
ness disorders such as Bethlem myopathy, Ullrich congenital muscular dys- 
trophy, limb-girdle muscular dystrophy phenotype, and autosomal recessive 
myosclerosis [39-41]. Type VI collagen functions have been further investi- 
gated in knockout mice. Triple helix type VI collagen is absent in Col6al 
knockout mice. Fiber necrosis and pronounced variations in fiber diameter are 
found [42], which demonstrates that type VI collagen helps maintain the integ- 
rity of skeletal muscle. In addition to muscular damage, Col6al knockout mice 
also show accelerated development of osteoarthritis [43]. In another study, type 
VI collagen knockout has protective roles after myocardial infarction, poten- 
tially by limiting chronic apoptosis, fibrosis, and scarring [44]. 


BIOMARKERS OF TYPE VI COLLAGEN 


Due to the specific roles of type VI collagen in the ECM environment, type VI 
collagen fragments generated during degradation or formation could be used as 
potential biomarkers for different disorders. Degradation fragment of type VI 
collagen induced by MMP-2 and MMP-9 (C6M) is highly increased during liver 
fibrosis [45] and exacerbations of chronic obstructive pulmonary disease (COPD) 
[46]. Pro-C6 (type VI collagen fragment generated during formation) is decreased 
during exacerbations of COPD [46]. In a human immobilization—remobilization 
study, C6M levels after 2 weeks of immobilization were a negative biomarker for 
muscle hypertrophy during retraining [47]. In 56-day bed rest study, Pro-C6 is 
related to the anabolic and catabolic responses to unloading and reloading [48]. 


Type VI Collagen Description References 
Gene name and COL6A1, location 21q22.3 (gene ID 1291) NCBI 
number COL6A2, location 21q22.3 (gene ID 1292) NCBI 
COL6A3, location 2q37 (gene ID 1293) NCBI 
COL6A4 (pseudogenes due to a chromosome 
break) 
COL6AS, location 3q22.1 (gene ID 256076) NCBI 
COL6A6, location 3q22.1 (gene ID 131873) NCBI 
Mutations with Bethlem myopathy (BM) [39-41] 


diseases in humans Ullrich congenital muscular dystrophy (UCMD) 
Limb-girdle muscular dystrophy phenotype 
Autosomal recessive myosclerosis 
Null mutation in In Col6A1 knockout mouse, triple helix-type VI [42-44] 
mice collagen was absent. Fiber necrosis, pronounced 
variations in fiber diameter, and accelerated 
development of OA are found. However, type 
VI collagen knockout has protective roles after 
myocardial infarction. 


Tissue distribution in Dermis, skeletal muscle, lung, blood vessels, [1-4] 
healthy states cornea, tendon, skin, cartilage, intervertebral 

discs, adipose 
Tissue distribution Upregulated in tissue fibrosis [35,36] 


in pathologically 
affected states 
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Type VI Collagen Description References 
Special domains Soluble cleaved C5 domain, also called (8,12,37,38] 
endotrophin 
Special neoepitopes The cleavage site between the C1 and C2 [8] 
domain of the a3 chain can release a soluble 
C-terminal fragment, endotrophin. However, the 
cleavage site is still unknown. 
Protein structure and Type VI collagen is composed of three different [1,3,6] 
function chains: a1, a2, and a3. The protein contains a 
short collagenous region and globular regions at 
both the N and C termini. Dimers are formed by 
two monomers in antiparallel manner, whereas 
tetramers are formed in a parallel manner. After 
secretion into the ECM, tetramers are assembled 
into a microfibrillar network. Type VI collagen is 
expressed in many tissues and connects related 
tissues to the matrix. 
Binding proteins Type IV collagen, biglycan, decorin, perlecan, [5,18,21-24] 
NG2 proteoglycan, fibronectin, tenascin, 
integrin. 
Known central Helps cell attachment and connects related [3,5,31] 
function tissues to the surrounding matrix 
Animal models with COI6A1 knockout mouse model [42] 
protein affected 
Biomarkers of that Pro-C6, C6M [45-48] 


protein 


NCBI, National Center for Biotechnology Information; OA, osteoarthritis; ECM, extracellular matrix. 
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SUMMARY 

Type VII collagen (COLVID is primarily synthesized by keratinocytes and fibroblasts. 
It is crucial for the function and stability of the extracellular matrix (ECM) as it is an 
anchoring fibril collagen, where it binds type I and type IM collagen, providing stability 
to the interstitial membrane and the basement membrane ECM structures. The NC1 can 
be subdivided into smaller components comprising adhesive proteins with high-binding 
affinity to basement membrane proteins, especially laminin-5 and laminin-6 and type IV 
collagen. The main mutation investigated for COLVII is epidermolysis bullosa, a severe 
skin disorder. COLVII has also been reported to be involved in autoimmune diseases such 
as systemic lupus erythematosus, Sjögren syndrome, and systemic sclerosis. The function 
and roles of COLVII are emerging, and no biomarkers are currently developed. 


Type VII collagen (COLVII) is primarily synthesized by keratinocytes and fibro- 
blasts. COLVII is approximately 424nm long [1]. The designated gene, named 
COL7A1, is located on chromosome 3. COLVII is strongly associated with the 
epithelia segment of tissues, especially the dermis. However, COLVII has also 
been reported to be present in other tissues including those of the rectum, colon, 
small intestines, esophagus, oral mucosa, cervix, placenta, skeletal muscle, and 
cornea [1,2]. 

COLVII is crucial for the function and stability of the extracellular matrix 
(ECM) as it is an anchoring fibril collagen [1]. COLVIU consists of three com- 
ponents, the N-terminal noncollagenous-1 (NC1) part (~145kDa), the triple 
a-chain helix collagenous domain, and the C-terminal noncollagenous-2 (NC2) 
part (~30 kDa) (Fig. 3d, Introduction) [3-6]. NC1 is the major component of 
COLVII and determines the ECM function due to its role as an anchoring fibril 
[1]. The NC1 can be subdivided into smaller components comprising adhesive 
proteins with high binding affinity to basement membrane proteins, especially 


Biochemistry of Collagens, Laminins and Elastin. http://dx.doi.org/10.1016/B978-0-12-809847-9.00007-6 
Copyright © 2016 Elsevier Inc. All rights reserved. 57 


58 Biochemistry of Collagens, Laminins and Elastin 


laminin-5 and -6 and type IV collagen. These adhesive proteins are mainly com- 
prised of segments with homology with von Willebrand factor, cartilage matrix 
proteins, and fibronectin type III domains [7,8]. 

For COLVII to be a fully functional anchoring fibril, two COLVII will attach 
at the C-terminal end to form a U-shaped structure (Fig. 4, Introduction) [9]. The 
U-shaped structure is now a COLVII dimer that functions as a trap for the intersti- 
tial collagen fibrils in the triple helix of COLVIL For the NC2 domains to assem- 
ble and form the U-shaped COLVII, the NC2 domain is removed by proteases. 
Moreover, the NC2 domain also includes a segment with homology to the Kunitz 
protease inhibitor [10], thus stopping the proteolytic cleavage of the C terminus. 
The interstitial type I, III, and V collagens are the major targets of entrapment by 
the U-shaped COLVII. In addition, the NC1 domain will bind basement mem- 
brane proteins, especially laminins and type IV collagens, and together with the 
U-shape formation facilitated by the NC2 domain, COLVII provide stability to 
the interstitial membrane and the basement membrane ECM structures [7,8]. 

COLVII mutations that lead to defects and loss of its function have critical 
impacts on the integrity of the ECM and thus of the tissue [11,12]. The skin dis- 
order dystrophic epidermolysis bullosa (DEB) is directly related to mutations in 
the COL7A1 gene [11], whereas in epidermolysis bullosa acquisita (EBA), auto- 
antibodies are generated against COLVII [2,9]. Hence, COLVII is also referred 
to as the epidermolysis bullosa acquisita-antigen [2]. Mutations affecting both 
alleles of the COL7A1 gene result in the absence of the COLVII protein [13]. 
The disease manifestations are similar in both DEB and EBA, which are primar- 
ily blistering and scarring of the skin [2,11]. 

EBA has also been linked to inflammatory bowel disease, especially Crohn dis- 
ease (CD). In 40% of all CD patients autoantibodies generated against COLVII have 
been found to be present in the serum [2]. Furthermore, CD can evolve so that EBA 
will manifest in some patients. Circulating autoantibodies against COLVII have 
been linked to other autoimmune diseases including systemic lupus erythematosus 
and Sjögren syndrome [14]. Recessive dystrophic epidermolysis bullosa (RDEB), 
is a type of DEB, but lacks collagen type VII epitopes. Interestingly RDEB has 
also been associated with aggressive squamous cell carcinomas. These cancer forms 
usually develop early in life and have very high metastatic rates, which drastically 
decrease the life span of affected patients. COLVII has also been shown to be upreg- 
ulated in cancer stem cells cultured as spheroids in vitro [15]. 

Increased expression and accumulation of COLVII in the skin of patients 
with systemic sclerosis (SSc), and accumulation of COLVII in SSc patients was 
directly regulated by transforming growth factor-B [16]. Moreover, it has been 
shown that matrix metalloproteinases (MMPs) can cleave COLVII [17] and gen- 
erate fragments that can be used to develop neoepitope biomarker assays. COLVII 
neoepitope assays could potentially add valuable information in detecting and 
managing diseases, such as DEB, EBA, and CD, where COLVII is affected 
since the neoepitopes may reflect chronic MMP activity that will continuously 
degrade COLVII. Furthermore, these fragments could be the potential triggers 
for generating antibodies against COLVII. 
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Collagen Type VII Description References 
Gene name and number COL7A1 {1,12] 
Mutations with diseases  Dystrophic epidermolysis bullosa [2] 

in humans 

Null mutation in mice COLVII knockout mice presented with severe [12] 


blistering of skin at birth, which resembles 
dystrophic epidermolysis bullosa in humans 


Tissue distribution in Skin, rectum, colon, small intestines, [1,21 
healthy states esophagus, oral mucosa, cervix, placenta, 

skeletal muscle, and cornea 
Tissue distribution in Skin is affected in dystrophic epidermolysis [2,14-16] 
pathologically affected bullosa as well as in 40% of Crohn disease 
states patients. COLVII has also been reported 


to be involved the autoimmune diseases 
eg, systemic lupus erythematosus, Sjogren 
syndrome, and systemic sclerosis. 

Special domains The NC1 is located in the N terminus, in [5,6] 
the collagenous domain containing the 
triple helix COLVII formation and in the 
C-terminal NC2 domain 


Special neoepitopes Interstitial MMP cleavage of COLVII [17] 
Protein structure and The NC1 domain has high affinity to proteins [11] 
function of the basement membrane, such as laminins 


and COLIV. The collagenous triple a-chain 

helix domain entraps interstitial collagens. 

The NC2 domain is responsible for dimer 

formation of COLVII, which facilitates 

entrapment of other collagens 
Binding proteins NC1 (von Willebrand factor) [3-6] 
Known central function Attachment to the basement membrane via [7] 

the NC1 domain, and interstitial collagen 


entrapment 
Animals models with The COLVII knockout mouse is a model for [12] 
protein affected dystrophic epidermolysis bullosa 
Biomarkers Autoantibodies against COLVII [18] 


COL, Collagen; MMP, matrix metalloproteinase. 
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SUMMARY 

Type VII collagen is a short-chain, nonfibrillar collagen and the major component of the 
Descemet’s membrane. Type VIII collagen is produced by endothelial cells and forms 
unique hexagonal lattice structures. It is found in the heart, brain, liver, lung, and muscles 
and around chondrocytes in cartilage. Furthermore, type VIII collagen is found around 
actively proliferating vessels of brain tumors and in large fibrotic vessels of angiomas. 
Mutations are involved in Fuchs endothelial dystrophy. Type VIII collagen both holds 
structural and signaling properties, as exemplified by vastatins potent inhibitory effects 
on angiogensis. There are currently both N- and C-terminal antibodies incorporated in 
enzyme-linked immunosorbent assays for type VIII collagen. 


Type VIII collagen is a short-chain, nonfibrillar collagen. It is the major component 
of the Descemet membrane—the basement membrane separating corneal endothe- 
lial cells from corneal stroma—of corneal endothelial cells. It is part of the endothe- 
lium of blood vessels and is present in arterioles and venules; thus, it is found in the 
heart, brain, liver, and lung and around chondrocytes in cartilage [1]. The human a1 
procollagen gene is located on chromosome 3, whereas the human a2 procollagen 
gene is located on chromosome 1. Each a chain has a molecular mass of approxi- 
mately 60kDa [2]. Previously, type VII collagen was described as a heterotrimer 
comprised of two al chains and one a2 chain [3], but in vitro studies have shown 
that homotrimers of either a1 or a2 are formed [4]. These homotrimers are pepsin 
resistant and an immunohistochemistry study showed they did not always colocal- 
ize in the cornea, optic nerve, aorta, and umbilical cord [5]. 

The al chain is 744 amino acids (aa) long, with the first 27 aa in the N-terminal 
being a signal peptide. The NC1 domain of type VIII collagen (aa 572-744) is 
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a protein named vastatin, which has been shown to be a potent angiogenesis 
inhibitor with apoptotic-inducing activity on aortic endothelial cells [6]. The 
a2 chain is 703 aa long, with a signal peptide in the N-terminal at aa 1-28. The 
al and the a2 procollagen genes each contain four exons. One of the charac- 
teristic features is the largest exon encodes the entire triple helical (COL1) and 
the C-terminal nontriple helical (NC1) domains. Type VIII collagen shares a 
high sequence homology with type X collagen along with a similar intron—exon 
structure. This suggests that both collagens arose from the same precursor gene 
and belong to the same collagen subclass [7—9]. Type VIII collagen is able to 
form hexagonal lattice structures (Fig. 4, Introduction) like type X collagen. 

Sequence analysis has shown that each a chain contains a collagenous triple 
helical domain of 454 aa, a long NC1 domain, and a shorter NC2 domain. Type 
VII collagen is a glycoprotein that is very susceptible to neutrophil elastase and 
is completely degraded within 4h, unlike other vascular collagens such as type 
I, II, IV and V collagen [10]. When incubated with pepsin, it will form two pep- 
sin-resistant fragments, one fragment from each chain, of around 50kDa [11]. 

Type VIII collagen is synthesized by aortic and corneal endothelial cells, 
pulmonary artery endothelial cells, and microvascular endothelial cells. Not all 
endothelial cells express type VIII collagen, since it is absent from large and 
small vessels [12]. In addition, it has been found that monocytes and macro- 
phages express type VIII collagen in vitro and in vivo [13]. Human mast cells 
have also been shown to produce type VII collagen under normal and patholog- 
ical conditions. It has been shown to contribute to angiogenesis, tissue remod- 
eling, fibrosis, and cancer [14-18]. Mast cells expressing type VIII collagen 
are found in perivascular spaces [19]. Type VIII collagen is also expressed by 
smooth muscle cells where it stimulates cell migration [20]. 

Corneal endothelium secretes type VIII collagen, which is assembled into a 
hexagonal lattice structure in the Descemet membrane by the interactions of a1 
and a2 polypeptides. Mutations in the COLSA2 gene are associated with early 
Fuchs endothelial corneal dystrophy, but there is no correlation with COLSA/ 
[21]. It is believed that type VIII collagen is involved in endothelial differen- 
tiation and organization [22]. In cardiac development type VII collagen plays 
an important role in vascularization where it has been immunolocalized to the 
subendothelium of capillaries and small arteries [23,24]. Type VIII collagen is 
upregulated in early atherogenesis and is suggested to be involved in thrombosis 
and monocyte infiltration. It is accumulated in atherosclerotic lesions, and its 
distribution patterns imply a role in plaque stabilization. Type VIII collagen is 
found around actively proliferating vessels of brain tumors and in large fibrotic 
vessels of angiomas [25]. Lastly, type VIII collagen is expressed in human dia- 
betic nephropathy, but not in other renal diseases [26]. 


BIOMARKERS OF TYPE VIII COLLAGEN 


Commercially available antibodies for the N- and C-terminal of type VIII 
collagen exist, but often the epitope is unspecified. The majority of the antibodies 
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are polyclonal; hence, they have a higher cross-reactivity toward nonspecific 
antigens. The available monoclonal antibodies are, in general, specific for a 
large aa sequence. Many diseases such as fibrosis and cancer have a high tis- 
sue turnover, ie, there is an increased formation and degradation of proteins, 
and possibly type VIII collagen. It is highly plausible that type VII collagen is 
cleaved by collagenases during tissue remodeling, and a monoclonal antibody 
specific for a special protease—generated fragment, or type VII collagen telo- 
peptides would assist in the discovery and quantification of the role of type VII 
collagen in health and disease. 


Type VIII Collagen Description References 
Gene name and COL8A1 (3q11.1-q13.2), COL8A2 [27,28] 
number (1p34.2-p32.3) 
Mutations with Fuchs corneal dystrophy [29] 
diseases in humans 
Null mutation in mice COl8a1 KO and COl8a2 KO develop [30] 
normally and have a normal life span. 
Col8a1~-/COl8a2-~ mice have 
dysgenesis of the anterior segment of 
the eye. 
Tissue distribution in Bone, brain, cartilage, eye, heart, [31] 


healthy states 


Tissue distribution in 
pathological affected 
states 


kidney, liver, lung, muscle, skin, spleen, 
vascular tissues, ligaments and tendons, 
nerves 

Upregulated in atherosclerosis 

and many tumors due to actively 
proliferating vessels. Expressed by mast 
cells in fibrotic tissues and contributes 
to the fibrotic changes in diabetes and 
diabetic nephropathy. Involved in Fuchs 
endothelial dystrophy 


[12,19,25,26,32-34] 


Special domains NC1 domain of «1 is known as vastatin, [6,35] 
a potent angiogenesis inhibitor with an 
apoptotic effect on aortic endothelial cells. 
Special neoepitopes Cleaved by neutrophil elastase into two [36] 
50K (M,) fragments; 50K-A (a1) and 
50K-B (a2) 
Protein structure and Heterotrimer comprised of two a1 [7] 
function chains and one a2 chain. Increasing 
evidence suggests homotrimers of each. 
Each chain has an NC1, NC2, and triple 
helical domain. 
Binding proteins a1f1-integrin and GPla/Ila (26 1- [20,37] 
integrin) 
Known central Vascular remodeling and maintenance [26,38] 
function of vessel wall integrity and structure 
Animals models with COI8 KO mice, Aca23 mice (missense [30,39] 


protein affected 
Biomarkers 


mutation) 
N- and C-terminal markers 


COL, Collagen; KO, knockout. 
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SUMMARY 

Type IX collagen is a fibril-associated collagen with interrupted triple helices. Type IX col- 
lagen is present in the chondrocytes of growth-plate cartilage, adult articular, cartilage and 
intervertebral discs. Mutations in type IX collagen can predispose individuals to multiple 
epiphyseal dysplasia, a clinically highly heterogeneous skeletal disorder, with early-onset 
osteoarthritis as a very common manifestation. Type IX collagen contributes to the stabili- 
zation of the fibrillar collagen network in the cartilage matrix and the anchorage of matrilin 
3 and proteoglycans, which controls the diameter of collagen fibrils; consequently, it is an 
essential part of articular cartilage. There are currently no biomarkers of type IX collagen, 
which is needed to further elucidate the role of type IX collagen in health and disease. 


Type IX collagen is a fibril-associated collagen with interrupted triple helices. 
Each type IX collagen molecule consists of three chains: one al chain, one a2 
chain, and one «3 chain; each of these chains is encoded by three genes: COL9A1 
(gene ID: 1297), COL9A2 (gene ID: 1298), and COL9A3 (gene ID: 1299), 
respectively [1]. Studies suggest that mutations identified in COL9AI [2], 
COL9A2 [3], and COL9A3 [4,5] can predispose individuals to multiple epiphyseal 
dysplasia (MED), a clinically highly heterogeneous skeletal disorder with 
early-onset osteoarthritis (OA) as a very common manifestation. Mice with a 
completely inactivated COL9A1 gene showed no obvious phenotypic alterations 
at birth, but thereafter they had a degenerative joint disease resembling human 
OA at 4months or older [6]. Transgenic mice lacking type IX collagen and 
cartilage oligomeric protein are likely to experience cartilage degeneration, which 
is thought to be mediated by the anchorage of matrilin 3 and proteoglycans. Type IX 
also controls the diameter of collagen fibrils [7]. 
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Type IX collagen is present in the chondrocytes of growth-plate cartilage, 
adult articular cartilage, and intervertebral discs. Immunostaining of type IX 
collagen in normal articular cartilage showed it is concentrated in the pericel- 
lular matrix, whereas a weaker intensity appeared in the territorial matrix [8]. 
In OA tissue higher contents of type IX collagen and its mRNA were found in 
the pericellular matrix of the weight-bearing areas adjacent to the defects in 
cartilage [9]. In addition, type IX collagen was reported to be distributed at the 
edge of fibrillation and fissure in cartilage, suggesting that type IX collagen 
remodeling is associated with the degeneration of cartilage in OA [8]. 

All three polypeptide units of type IX collagen, unlike those in fibrillar- 
forming collagens, contain three triple-helix domains (COL1, COL2, and 
COL3) interspersed with four non-triple-helix domains (NC1, NC2, NC3, and 
NC4). Another difference is the longer globular structure of NC4 flanked at 
the N terminus by al chain [10] in type IX. Type IX collagen is located on the 
surface of collagen fibrils where, in combination with types II and XI collagens, 
it forms the unique heterofibril network in the matrix of cartilage [11]. NC1 and 
NC3 contain cysteinyl residues involved in interchain disulfide bond forma- 
tion. A covalent crosslinking of the NC-1 domain in type IX to type II collagen 
in cartilage has been detected previously [12]. The N-terminal helical region 
of the COL2 domain of type IX collagen provides covalent bond sites to the 
N-telopeptide of a1(II), whereas a covalent bond from the region of COL2 (IX) 
forms with the C-telopeptide of a1(1) [13]. COL3 and NC4 domains project 
away from the collagen fibril, and the NC4 domain provides sites for interac- 
tions with other matrix components such as glycosaminoglycans and cartilage 
oligomeric matrix protein (COMP) [14], fibronectin [15], and fibromodulin and 
osteoadherin [16]. In addition, COL3 and NC4 domains bind to C4, C3 and C9, 
forming a novel inhibitor of the complement system [17]. 

Type IX collagen fulfills important functions for cartilage integrity and 
stability. It also influences the diameter of fibrils through its interaction with 
COMP at the NC4 domain [7]. In a rabbit model type IX collagen content was 
significantly reduced in femoral and tibial cartilage at 6 days after antigen oval- 
bumin-induced rheumatoid arthritis [18]. The immunohistochemical distribu- 
tion of type IX collagen changed in spontaneously osteoarthritic canine tibial 
cartilage compared with normal cartilage [8]. These animal models suggest that 
type IX collagen alterations might be implicated in the pathogenesis of rheuma- 
toid and osteoarthritic arthritis. 


BIOMARKERS OF TYPE IX COLLAGEN 


Type IX collagen is susceptible to enzymatic degradation. The loss of type IX 
collagen at the surface of collagen fibrils may make the underlying cartilage 
vulnerable to proteolytic processing. Release of type IX collagen has been 
investigated in cartilage in vitro, ex vivo, and in in vivo destruction models. The 
degradation of type IX collagen was conducted by matrix metalloproteinase 
(MMP)-3 in vitro and at cleavage sites between 779S and 780L within the NC2 
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domain [19]. Two major cleavage sites at 258R—259I and 400G—-401T, which 
bound with MMP-13, were identified within the NC4 and the COL3 domains, 
respectively, in interluekin-1—treated bovine nasal cartilage [20]. Antibodies 
directed against these neoepitopes might be used to detect degradation caused 
by proteolytic enzymes in the early stages of OA and rheumatoid arthritis. 

Taking all evidence together, type IX collagen is a minor, but crucial, com- 
ponent of collagen fibrils in the cartilage matrix. Turnover of type IX colla- 
gen by proteases is an early event in degenerative joint disease. Neoepitopes of 
type IX collagen degradation might be potential biomarkers for diagnosing and 
assessing response to treatment in degenerative arthritis. 


Type IX Collagen Description References 
Gene name and number COL9A1(gene ID: 1297), COL9A2 (gene ID: [1] 
1298), and COL9A3 (gene ID: 1299) 


Mutations with diseases Predispose for MED phenotypes 2-5] 
in man 
Null mutation in mice Showed no alterations at birth, but associated [6,7] 


with degenerative joint disease resembling 

human osteoarthritis at 4 months or older 
Tissue distribution in Type IX collagen is present in chondrocytes 8] 
healthy states of growth-plate cartilage, adult articular 

cartilage, and intervertebral discs 

Immunostaining of normal articular cartilage 

showed type IX collagen is concentrated in 

the pericellular matrix, whereas a weaker 

intensity appeared in the territorial matrix 


Tissue distribution in In the pericellular matrix of the weight- [8,9] 
pathologically affected bearing areas adjacent to cartilage defects 

states and at the edge of fibrillation and fissure 

Special domains Three triple-helix domains and four [12-17] 


non-triple-helix domains, providing 
covalent crosslink sites and, especially at 
the NC4 domain, binding to other matrix 
components 
Special neoepitopes In vitro, MMP-3 cleaves to a site between [19,20] 
779S and 780L within the NC2 domain 
Two major cleavage sites at 258R-2591 
and 400G-401T bound by MMP-13 were 
identified within the NC4 and COL3 
domains, respectively, in IL-1-treated 
bovine nasal cartilage 
Protein structure and Each type IX collagen molecule consists of — [1,10,11] 
function three chains. All three chains contain three 
triple-helix domains interspersed with four 
non-triple-helix domains. Type IX collagen 
in combination with types Il and XI collagen 
form the unique heterofibiril network in the 
matrix of cartilage 
Binding proteins Type II collagen, COMP, fibronectin, {12,14-17] 
fibromodulin, and osteoadherin [16]; also, 
type IX binds to C4, C3, and C9 to forma 
novel inhibitor of the complement system 
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Type IX Collagen Description References 
Known central function Contributes to the stabilization of the [7] 


fibrillar collagen network in the cartilage 
matrix and the anchorage of matrilin 3 and 
proteoglycans. It controls the diameter of 
collagen fibrils 


Animals models with Ovalbumin-induced rheumatoid arthritis [8,18] 
protein affected in a rabbit model showed significantly 


reduced type IX collagen content in femoral 
and tibial cartilage. Immunohistochemistry 
showed the distribution of type IX collagen 
was different in spontaneously osteoarthritic 
canine tibial cartilage compared with 
healthy tissue 


Biomarkers A typically used reliable biomarker of type 


IX collagen has not been reported 


COL, Collagen; MED, multiple epiphyseal dysplasia; MMP, matrix metalloproteinase; IL, interleu- 
kin; COMP, cartilage oligomeric matrix protein. 
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SUMMARY 

Type X collagen is a network-forming collagen. Type X collagen is mainly expressed in 
hypertrophic chondrocytes in cartilage, with expression usually limited to the hypertro- 
phic zone of the growth plate and in the calcified zone of articular cartilage of long bones, 
where type X collagen seems to facilitate calcification. More than 30 different mutations 
have now been characterized. The most common mutation is Schmid metaphyseal chon- 
drodysplasia. There are emerging biomarkers of type X collagen, which have been shown 
to be elevated in rheumatological disorders affecting bone and cartilage. 


Type X collagen is a network-forming collagen. Type X collagen is mainly 
expressed in hypertrophic chondrocytes in cartilage. The expression is usually 
limited to the hypertrophic zone of the growth plate and in the calcified zone of 
articular cartilage of long bones. In healthy conditions of adults type X collagen 
constitutes about 1% of the total amount of collagen in cartilage [1]. Type X col- 
lagen is coded by the gene COL10A1 [2], and it is a nonfibrillar collagen consist- 
ing of a homotrimer of three a1(X) chains [2]. Each chain has three domains: a 
short triple helix of 463 amino acids (aa), flanked by a highly conserved noncol- 
lagenous (161-aa) globular domain (NC1) at the carboxyl end, and a short (38- 
aa) nonhelical amino-terminal domain (NC2) [3]. This structure is susceptible to 
interstitial collagenase and gelatinase cleavage at at least two distinct sites within 
the triple-helix domain, generating three peptides of 32-, 18-, and 9-kDa chains. 
The 32-kDa peptides are present in a triple helical conformation. The 18-kDa 
peptide contains the tyrosine-rich globular domain of the molecule. The 9-kDa 
peptide is derived from the triple helical end of the native molecule [4]. Human 
neutrophil elastase [5] and pepsin and trypsin [6] have been identified to cleave 
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collagen X as well. The sequence of human type X collagen shows a high degree 
of similarity with those of bovine, chicken, and mouse, which indicates a high 
conservation of the COL10A1 gene throughout evolution [3,7,8]. 

Increased extracellular deposition and expression are found in joint degen- 
erative diseases as the chondrocytes become hypertrophic [3,9,10]. Multiple 
upstream negative regulatory mechanisms act to restrict transcription of type X 
collagen to hypertrophic chondrocytes [11]. In contrast type X collagen consti- 
tutes up to 45% of the total collagen produced by mature hypertrophic chondro- 
cytes, and it is therefore a major protein product secreted by this cell type [3]. 
Observations of immunostained cephalic sternal sections from chick embryos 
suggest that type X collagen is localized in a capsule-like configuration sur- 
rounding each hypertrophic chondrocyte [12]. Interestingly, the gene expres- 
sion of type X collagen in late-stage osteoarthritic cartilage is not significantly 
upregulated according to findings by Brew et al. [13]. A possible explanation 
for this could be that some chondrocytes in early osteoarthritis become hyper- 
trophic and produce type X collagen preceding apoptosis. Thus, type X collagen 
is no longer represented in the gene expression of late osteoarthritic tissue [13]. 

The biological function of type X collagen seems to be to facilitate calcifica- 
tion and to play a role in the normal distribution of matrix vesicles and proteogly- 
cans within the growth plate [14]. Type X collagen is considered to be of major 
importance in endochondral bone growth and development, although the precise 
function the protein plays and its role in ossification have remained unresolved 
[3]. Collagen X deficiency impacts the supporting properties of the growth plate 
and the mineralization process, resulting in abnormal trabecular bone [14]. 


BIOMARKERS OF TYPE X COLLAGEN 


Type X collagen serves as a marker of the terminally differentiated, hyper- 
trophic chondrocytes. Gene transcription and translation products are thereby 
used for determination of chondrocyte growth and differentiation [15]. The bio- 
marker C-10C measures significantly increased levels of type X fragments in 
the serum of patients with osteoarthritis compared with healthy controls. C-10C 
is specific for fragments of the sequence SFSGFLVAPM, which corresponds to 
the C-terminal site of the NC1 domain [16]. 

Inactivation of the COL10A1 gene in mice was first reported not to affect the 
phenotype, which suggests that only mutated versions of the gene give rise to 
skeletal problems [2]. It has since been found that mice deficient in type X col- 
lagen experience phenotypic consequences including abnormal trabecular bone 
architecture and coxa vara [14]. More than 30 different mutations of COLIOA1 
have been identified. All are characterized by alteration of the structure of the 
C-terminal NC domain of the polypeptide. This observation suggests that the 
mutant chains are unable to associate to form triple helical molecules. This sce- 
nario occurs in Schmid metaphyseal chondrodysplasia, a disease characterized 
by shortening of the legs, which is aggravated by walking [17]. 
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Type X Collagen Description References 

Gene name and number —=COL10A1-collagen, type X, alpha Gene ID: 1300 

16q21-q22 

Mutations with diseases More than 30 different mutations [17] 

in humans have now been characterized. The 

most common is Schmid metaphyseal 
chondrodysplasia 

Null mutation in mice Abnormal trabecular bone, coxa vara, [14] 

reduction in thickness of the growth 
plate resting zone and articular cartilage, 
altered bone content and atypical 
distribution of matrix components 
within the cartilage growth plate 

Tissue distribution in Hypertrophic zone of the growth plate 1] 

healthy states and basal calcified zone of articular 
cartilage 

Tissue distribution in Increased expression in osteoarthritis 10] 

pathologically affected when chondrocytes become 

states hypertrophic 

Special domains Triple helix, NC1 and NC2 18 

Special neoepitopes N and C termini 19,20] 

Protein structure and A nonfibrillar collagen consisting of 21] 

function three identical a1 chains 

Binding proteins Anchorin Cll 22] 

Known central function Regulation of endochondral ossification 3,14] 

and supporting properties of the growth 
plate of cartilage and the mineralization 
process 

Animals models with None 

protein affected 

Biomarkers C-CO110 [16] 
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SUMMARY 

Type XI collagen is a fibrillary collagen. Type XI collagen is broadly distributed in artic- 
ular cartilage, testis, trachea, tendons, trabecular bone, skeletal muscle, placenta, lung, 
and the neoepithelium of the brain. Type XI collagen is able to regulate fibrillogenesis 
by maintaining the spacing and diameter of type II collagen fibrils, and a nucleator for 
the fibrillogenesis of collagen types I and II. Mutations in type XI collagen are asso- 
ciated with Stickler syndrome, Marshall syndrome, fibrochondrogenesis, otospondylo- 
megaepiphyseal dysplasia deafness, and Weissenbacher-Zweymiiller syndrome. Type 
XI collagen binds heparin, heparan sulfate, and dermatan sulfate. Currently there are no 
biomarkers for type XI collagen. 


The COLIIAI, COL11A2, and COL11A3 genes encode a chains of type XI 
collagen (COLXTI), a member of the fibrillar collagen subgroup [1,2]. COLXI is a 
minor constituent of type I—containing fibrils and presents an al (XDa2(X Da3 (XT) 
heterotrimer, which is most abundantly found in cartilage. Mutations in the genes 
are associated with Stickler syndrome, Marshall syndrome, fibrochondrogenesis, 
otospondylomegaepiphyseal dysplasia (OSMED) deafness, Weissenbacher— 
Zweymiiller syndrome, and nonsyndromic deafness [3-5]. 

Type XI collagen molecules are primarily crosslinked to each other in car- 
tilage. The crosslinks result in the formation of mature type XI collagen fibers, 
with the help of collagen types II and IX. Type XI collagen is broadly distributed 
in articular cartilage, testis, trachea, tendons, trabecular bone, skeletal muscle, 
placenta, lung, and the neoepithelium of the brain [6,7]. Type XI collagen is 
also a part of the jelly-like substance that fills the eyeball, the inner ear, and the 
center portion of the intervertebral discs. However, it is preferentially retained 
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at the chondrocyte surface and involved in the organization of the pericellular 
matrix via interaction with cartilage proteoglycans [8]. 

Type XI collagen can form uniformly thin fibrils by acting as the basic struc- 
ture for the type II collagen fibril network of developing cartilage. It is also the 
nucleator for the fibrillogenesis of collagen types I and II [9-11]. It is able to 
regulate fibrillogenesis by maintaining the spacing and diameter of type II col- 
lagen fibrils. 

It is reported that type XI collagen plays pathogenic roles in musculoskeletal 
diseases. Deficiencies in this collagen can result in chondrodystrophies in mice 
and humans. The thrombospondin-like domains of collagen XI extend from the 
core microfibrils onto the fibril surface [9]. Retained N-propeptide domains are 
thought to inhibit lateral growth as well as regulate fibril assembly [12]. 

Type XI collagen is resistant to collagenases, but it is hydrolyzed by gelati- 
nases, giving rise to many peptides. Enzymatic degradation of type XI collagen 
was believed to play a significant role in the turnover of articular cartilage in 
both healthy and diseased states [13]. It has been shown that one mutation of 
type XI collagen leads to increased degradation of type II collagen in articular 
cartilage [14]. Immunization of rats with homologous type XI collagen was 
observed to lead to chronic and relapsing arthritis with different genetics and 
joint pathology than arthritis induced by homologous type II collagen [15]. 
Type XI collagen-induced arthritis is believed to be associated with the RT1f 
haplotype, in contrast to type II collagen-induced arthritis, which is associated 
with the RT1la and RT 1u haplotypes [16]. 

In addition, many functional studies have confirmed that type XI collagen 
is distributed in a variety of cancerous tissues including breast, lung, colon, and 
pancreatic tumors. This distribution suggests type XI collagen plays a critical 
role in cancer proliferation, invasion, and metastasis, as well as in resistance to 
therapeutics [17]. Levels of ColXIal have also been shown to be increased, 
compared with normal tissues, in several tumor types including colorectal, pan- 
creatic, breast, and non-small-cell lung cancer [18]. It can be used as an accu- 
rate marker for differential diagnosis of breast carcinoma invasiveness [19,20]. 
Collagen XIa1, with its low expression in normal tissue and high expression in 
cancer, may be an important therapeutic target [21]. 


Type XI Collagen Description References 
Gene name and number COL11A1 (gene ID: 1301) [1,2] 
COL11A2 (gene ID: 1302) 

COL11A3 (ie, COL2A1) 
Mutations with diseases Stickler syndrome [3-5] 
in humans Marshall syndrome 
Fibrochondrogenesis (severe, autosomal- 
recessive, short-limbed skeletal dysplasia) 
OSMED 
Weissenbacher—Zweymiller syndrome 
Nonsyndromic deafness 
Null mutation in mice Homozygous chondrodysplasia (cho/cho) [10] 
mouse 
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Type XI Collagen Description References 
Tissue distribution in Intervertebral discs, articular cartilage, testis, [6,7] 
healthy states trachea, tendons, trabecular bone, skeletal 

muscle, placenta, lung, and neoepithelium of 

the brain 
Tissue distribution in Cancerous tissues [17] 
pathologically affected 
states 
Special domains Thrombospondin-like, noncollagenous (9,12,22] 
Special neoepitopes N.A. 
Protein structure and Heterotrimeric molecules composed of a1(XI), [9,10,23] 
function a2(XI), and «3(X1) subunits 


Forms uniformly thin fibrils; acts as the basic 
structure template for the characteristic type II 
collagen fibril network of developing cartilage; 
nucleator for the fibrillogenesis of collagen 
types | and II; maintains the spacing and 
diameter of type II collagen fibrils; establishes 
and maintains tissue integrity and cohesion 


Binding proteins Heparin, heparan sulfate, and dermatan [11] 
sulfate 
Known central function Regulation of fibrillogenesis, such as by {11] 
controlling the diameter of major collagen 
fibrils 
Animals models with Type XI collagen-induced arthritis in DBA/1 [16,24] 
protein affected mice; type XI collagen-induced chronic 
arthritis in the rat 
Biomarkers Splicing patterns of type XI collagen [19,20] 


(markers for osteochondrogenic tumors); 
type XI collagen a1 (accurate marker in the 
differential diagnosis of breast carcinoma 
invasiveness in core needle biopsies) 


COL, collagen; N.A., not applicable; OSMED, otospondylomegaepiphyseal dysplasia deafness. 
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SUMMARY 

Type XII collagen is a fibril-associated collagen with interrupted triple helices. Type 
XII collagen is a component of human skeletal muscle. Type XII collagen has special 
domains such as the NC3 domain that carry glycosaminoglycan chains, and it interacts 
with matrix proteins such as decorin, cartilage oligomeric matrix protein, fibromodu- 
lin, and tenascin. Coll2a null mice display skeletal abnormalities such as shorter and 
smaller long bones with reduced mechanical strength and altered vertebrae structure. 
Col12a(—/—) osteoblasts are disorganized and less polarized, with disrupted cell-cell 
interactions. Recessive and dominant mutations in COL12A1 cause a novel Ehlers— 
Danlos syndrome/myopathy overlap syndrome in humans and mice. Presently there are 
no biomarkers of type XII collagen. 


Type XII collagen is a fibril-associated collagen with interrupted triple heli- 
ces. It is a component of human skeletal muscle. One of its functions is 
to temporarily stabilize type I collagen fibrils at its collagenous domain 
by preventing the fibrils from permanently crosslinking [1]. By organizing 
collagen fibrils it maintains bone and muscle integrity [2] and can regulate 
osteoblast polarity and communication during bone formation [3]. These 
properties are demonstrated by the mutations in type XII collagen, which 
cause Ehlers—Danlos syndrome associated with skeletal abnormalities and 
muscle weakness in mice and humans [2]. Type XII collagen is a homotri- 
mer assembled from three homolog al chains encoded by the COLXIIA1 
gene. It is composed of the collagenous domain (COLI and COL2), three 
N-terminal noncollagenous domains (NCI—NC3) assembled by disulfide 
bonds, and a short interrupted collagen triple helix near the C terminus [4]. 
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The NC3 domain carrying glycosaminoglycan chains interacts with matrix 
proteins such as decorin, fibromodulin [5,6], and tenascin [7]. Tenascin 
plays a functional role in stabilizing bone and muscle structure, and when 
mutated causes Ehlers—Danlos syndrome. Alternative splicing of the NC3 
domain into a short and a long variant generates two isoforms of type XII 
collagen: type XIIa of 350kDa and a smaller type XIIb of 220kDa that 
lacks glycosaminoglycan chains and resembles type XIV collagen [4]. In 
humans the most abundant isoform is type XIIa collagen [8—10]. The NC1 
domain can be alternatively spliced, generating a different C terminus of 
type XII collagen, and thus up to four isoforms of type XII collagen could 
exist [11]. 

Type XII collagen has functions other than maintaining the integrity of 
bone and muscle. It may indicate hyaline cartilage development by rediffer- 
entiating passaged and primary chondrocytes in the early stages of cartilage 
tissue formation [12]. In addition, Polacek et al. showed type XII collagen 
is present in the secretome of human passaged chondrocytes [13], but it was 
not detected in the secretome of cartilage explant [12]. It may influence cell 
differentiation as well, because knockout mice show delayed endothelial 
cell maturation [14]. 

Another role of type XII collagen may be to stabilize the vascular struc- 
ture and prevent the formation of atherosclerotic lesions. In vitro, human 
umbilical vein endothelial cells exposed to fluid shear stress secrete type XII 
collagen. Supporting data are that when type XII collagen is highly preva- 
lent in the human aortic wall, atherosclerotic lesions are absent, whereas 
collagen XII is only weakly present in the intima of atherosclerotic plaques 
[15]. It can only be speculated as to whether this activity against athero- 
sclerotic lesions is due to the von Willebrand factor A-like domain present 
in type XII collagen. Platelets are major players in the initiation of the ath- 
erogenetic process during which the von Willebrand factor is a well-known 
adhesive mediator between platelets and the vascular wall [16]. Interest- 
ingly, patients with hemophilia or von Willebrand disease are believed to be 
protected from atherosclerosis because of their coagulation defect, although 
this hypothesis remains controversial [17]. 

Finally, it has been speculated that type XII collagen is involved in fibrosis 
and cancer, in particular by aiding cancer cell dissemination. The expression of 
type XII collagen is considerably higher in the highly malignant breast cancer 
cell line MCFIOCA1 compared with a nonmalignant cell line [18]. In vivo, type 
XII collagen has been found in the stroma of breast invasive ductal cancer [19] 
and in the desmoplastic invasive front of colorectal cancer metastasis [20]. It has 
been found as a marker of myofibroblastic differentiation during colorectal can- 
cer metastasis [20]. Expression of type XII collagen is altered by transforming 
growth factor-$, a well-known contributor to breast cancer [21,22]. Bleomycin- 
induced pulmonary fibrosis increases expression of types XII and XIV collagen 
in mice [23]. 


Type XII Collagen 
Gene name and 
number 


Mutations with diseases 
in humans 


Null mutation in mice 


Tissue distribution in 
healthy states 


Tissue distribution in 
pathologically affected 
states 

Special domains 


Special neoepitopes 
Protein structure and 
function 
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Description 

COLXIIA1, chromosome 6q12-q13. 
Transcript variant long, NCBI accession 
no. NM_004370.5. 

Transcript variant short, NCBI accession 
no. NM_080645.2. 

Recessive and dominant mutations in 
COL12A1 cause a novel EDS/myopathy 
overlap syndrome in humans and mice. 
COL12A1 gene polymorphism is 
associated with anterior cruciate ligament 
ruptures in women. 

COL12A null mice display skeletal 
abnormalities. They have shorter and 
smaller long bones with reduced 
mechanical strength and altered 
vertebrae structure. COL12A (-/-) 
osteoblasts have delayed maturation 

and are disorganized and less polarized, 
with disrupted cell-cell interactions; 
decreased connexin43 expression; and 
impaired gap junction function. 
Abnormal corneal endothelial maturation 
in collagens XII and XIV null mice. 
Recessive and dominant mutations in 
COL12A1 cause a novel EDS/myopathy 
overlap syndrome in humans and mice. 
Found in association with type | collagen. 
Both isoforms appear in amnion, chorion, 
skeletal muscle, small intestine, and in 
the cell culture of dermal fibroblasts, 
keratinocytes, and endothelial cells. 
Only the short isoform is found in the 
lung, placenta, kidney and a squamous 
cell carcinoma cell line. 

Only the long isoform is found in the 
corneal epithelium, Bowman membrane, 
and the interfibrillar matrix of the corneal 
stroma. 

In the stroma of IDC breast cancer; in the 
desmoplastic invasive front of colorectal 
cancer metastasis. 

NC1 long and NC3 short splice variants 
of type XII collagen are overexpressed 
during corneal scarring. 

N.A. 

Type XIl is a fibril-associated collagen 
with interrupted triple helices (FACIT) 
collagen, which functions to temporarily 
stabilize type | collagen fibrils as its 
collagenous domain prevents the fibrils 
from permanently crosslinking. 
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Type XII Collagen Description References 

Binding proteins Decorin, fibromodulin, tenascin-X, [5-7,30] 
COMP. 

Known central function Protect bone and muscle integrity by [2] 
organization of collagen fibrils. 

Animals models with N.A. N/A 

protein affected 

Biomarkers N.A. [20] 


COL, Collagen; COMP, cartilage oligomeric matrix protein; EDS, Ehlers-Danlos syndrome; FACIT, 
fibril-associated collagen with interrupted triple helices; /DC, invasive ductal cancer; N.A., not 
applicable; NCBI, National Center for Biotechnology Information. 
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SUMMARY 

Type XIII collagen is a nonfibrillar, type II trans-membrane collagen. It binds to sev- 
eral proteins such as fibronectin, perlecan, nidogen-2, vitronectin, type IV collagen and 
alpi integrin. Type XIM collagen expression is more pronounced during development 
and post-natal growth but decreases toward adulthood and is expressed in cells produc- 
ing connective tissue, with higher expression in certain tumors, in corneal wound heal- 
ing and renal fibrosis. Loss of type XIII collagen is not lethal, but affects maturation of 
both the pre-synaptic and post-synaptic specializations of the neuromuscular junctions. 
Overexpression of the normal type XIII collagen a-chain in cartilage and periosteal osteo- 
blasts results in the development of massive bone overgrowth due to an enhanced osteo- 
blast differentiation capacity. There are currently no biomarkers of type XIII collagen. 


Type XIII collagen was first described in 1987 [1]. It is a nonfibrillar, type 
II trans-membrane collagen. The group of trans-membrane collagens also 
includes collagen types XXIII and XXV [2,3] and XVII, although XVII is dif- 
ferent in structure from the other trans-membrane collagens. Type XIII collagen 
consists of three identical a1 (XIII) chains (COLXTIIA 1), each of which contains 
three collagenous sequences, COL1-3, with sizes of 104, 172, and 235 residues, 
respectively, and noncollagenous domains, NC2-4, with sizes of 34, 22, and 
18 residues [4—7]. The N-terminal noncollagenous domain NC1, encompasses 
a 38-residue cytosolic domain, a 23-residue trans-membrane domain, and the 
first 60 residues of the noncollagenous extracellular sequences adjacent to the 
plasma membrane [4]. The a chains fold, unlike other collagens, into homotri- 
mers in an N- to C-terminal direction [8]. The large ectodomain harbors a furin- 
type endoprotease recognition sequence RRRR! from which the ectodomain 
can be released into the pericellular matrix [5,8]. 
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In humans, 10 exons undergo alternative splicing that seems to occur indepen- 
dently of each other. The alternative splicing primarily affects Coll, Col3, NC2, 
and NC4 [9,10]. The remaining domains, Col2, NC1, and NC3, which are not 
alternatively spliced, are likely to be of high importance in the function of type 
XIII collagen due to their stability. However, their exact function is still unknown. 

Type XII collagen expression is more pronounced during development and 
postnatal growth, but decreases toward adulthood and is expressed in cells pro- 
ducing connective tissue [11-14]. Its expression is induced in certain tumors, 
in corneal wound healing, and in renal fibrosis [15-17]. Studies of the function 
of type XIII collagen suggest it is involved in various biological maturation and 
differentiation processes, some of which are associated with inflammation and 
vasculogenesis. Type XIII collagen seems to exert its effects through ligand 
binding, anchoring, and oligomerization. It binds to several proteins such as 
fibronectin, perlecan, nidogen-2, vitronectin, type IV collagen, and «161 inte- 
grin. It apparently has a further role in regulating inflammation and immunity 
and may provide protection against cancer development. 

Studies in mice imply that the trans-membrane domain of type XIII collagen 
is of functional importance, but is not necessary for all functions of this collagen 
[18]. Studies with Col13a1N’/N mice have revealed a role for the N terminus of 
type XIII collagen in anchoring muscle cells to the basement membrane. These 
mice have an abnormal plasma membrane—basement membrane interface. Lack 
of type XIII collagen is not lethal, but maturation of both the presynaptic and 
postsynaptic specializations of the neuromuscular junctions is compromised 
[14]. Furthermore, overexpression of the normal collagen type XIII a chain in 
cartilage and periosteal osteoblasts results in the development of massive bone 
overgrowth due to an enhanced osteoblast differentiation capacity [13]. In sum- 
mary, type XIII collagen contributes to differentiation of certain cell types, bone 
balance, and maturation of neuromuscular junctions. 


BIOMARKERS OF TYPE XIII COLLAGEN 


Due to the shedding of the ectodomain a neoepitope is released into the extra- 
cellular matrix. This neoepitope is not thereafter released to circulation, but 
upon further degradation of the ectodomain this specific neoepitope could be 
released to circulation and be a biomarker of type XIII collagen. However, as 
the level of type XIII collagen in healthy states is low, this biomarker could be 
useful in cancer, corneal wound healing, and renal fibrosis, as the level of type 
XIII collagen is increased in these pathological states. 

Another potential biomarker could be a measure of autoantibodies to type 
XIII collagen. Antibodies to type XIII collagen have been found to be increased 
in sera from Graves ophthalmopathy patients and have been studied as biomark- 
ers in the disease [19]. An enzyme-linked immunosorbent assay (COLXIIAb) 
for detection of the autoantigen has been made by Wall and colleagues [19]. 
Graves patients with active ophthalmopathy had higher levels of COLXIITAb 
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than Graves patients without ophthalmopathy and controls. This indicates that 
COLXIIAb is a biomarker of inflammation of orbital adipose connective tissue 
[20,21]. In a small study of 10 patients receiving rituximab, no decrease in the 
biomarker was seen upon treatment [22]. 


Type XIII collagen 
Gene name and number 


Mutations with diseases 
in humans 
Null mutation in mice 


Tissue distribution in 
healthy states 


Tissue distribution in 
pathologically affected 
states 


Special domains 


Special neoepitopes 


Protein structure and 
function 


Binding proteins 


Known central function 
Animals models with 
protein affected 
Biomarkers 


Description 

COLXINA1, 1ID:1305 
Location: 10q22, exon 41 
N.A. 


Not lethal, but maturation of both 
the presynaptic and postsynaptic 
specializations of the neuromuscular 
junction is compromised 
Connective tissue—producing cells 
and in focal adhesions. Often in 
blood vessels and junctional 
structures such as neuromuscular 
structures 

During development and postnatal 
growth and is decreasing toward 
adulthood. Increased expression in 
certain tumors, corneal wound 
healing, and renal fibrosis 

aa 1-44: cytoplasmic 

aa45-61: transmembrane. Signal 
anchor for type II collagen 

aa 62-717: topological domain 

aa 1-121: NC1 

aa 122-216: CON 

aa 217-269: NC2 

aa 270-441: Col2 

aa 442-463: NC3 

aa 464-699: Col3 

aa 700-717: NC4 

Shedding occurs at aa 61-62. 
Sequence 62-69: Hfrtaelq 
Transmembrane, nonfibrillar 
collagen 

Many alternative spliced transcript 
variants 

Fibronectin, perlecan, nidogen-2, 
vitronectin, collagen IV, a1ß1 
integrin 

Covalent crosslinking collagen 
Homozygous Col13a1N/N 


Autoantibodies of type XIII collagen 
are used as biomarkers of Graves 
ophthalmopathy 


COL, Collagen; N.A., not applicable; aa, amino acids. 
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SUMMARY 

Collagen is a fibril-associated collagen with interrupted triple helices found mainly in 
skin, tendon, cornea, and articular cartilage. It regulates fibrillogenesis by limiting fibril 
diameter through the prevention of lateral fusion of adjacent fibrils. Type XIV collagen 
is often present in areas of high mechanical stress, indicating it potentially has a role in 
maintaining mechanical tissue. Type XIV collagen has three a1 (XIV) chains composed 
of three noncollagenous (NC1-3) and two collagenous domains (COL1 and COL2) that 
adhere to fibrillar collagen. Loss-of-function mutation in results in punctuated palmoplan- 
tar keratoderma, and mice null for collagen XIV are viable; however, formation of the 
interstitial collagen network is defective in tendons and skin, leading to reduced biome- 
chanical function. There are no biomarkers available for type XIV collagen. 


Type XIV collagen is a fibril-associated collagen with interrupted triple helices 
(FACIT) found mainly in skin, tendon, cornea, and articular cartilage [1—4]. It reg- 
ulates fibrillogenesis by limiting fibril diameter through the prevention of lateral 
fusion of adjacent fibrils [1]. This is unlike another FACIT collagen, type XII col- 
lagen, which stabilizes fibrils during development and remodeling [5]. Type XIV 
collagen has three al (XIV) chains composed of three noncollagenous (NC1-3) 
and two collagenous domains (COL1 and COL2) that adhere to fibrillar collagen 
[6] (Fig. 4, Introduction). The NC3 domain contains modules structurally simi- 
lar to von Willebrand factor A-like domains and fibronectin type HI repeats that 
mediate its functional properties [6]. Alternative splicing of the NC1 domain gen- 
erates two splice variants that are expressed during development and have been 
suggested to mediate the maturation of fibrils [1]. 

Type XIV collagen is often present in areas of high mechanical stress, indi- 
cating it potentially has a role in maintaining mechanical tissue [2,3]. It has been 
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shown to be important for extracellular matrix assembly and tissue function [7]. 
Null mice for type XIV collagen have been shown to be viable, but the forma- 
tion of the interstitial collagen network is defective in tendons and skin, leading 
to reduced biomechanical function. These mice had defects in fibril growth and 
fiber assembly during embryonic development [7]. Type XIV, as well as type 
XII, collagen null mice are also found to have delayed endothelial maturation 
of the cornea. The structural alterations suggest changes in the endothelium 
result in increased corneal thickness [8]. It may be speculated that in the fibrotic 
process type XIV collagen plays a role distinctly different to that of the FACIT 
type XII collagen. Unlike type XII collagen, type XIV collagen has been found 
to be present in the later stages of bleomycin-induced mouse lung fibrosis [8,9]. 


Type XIV Collagen 
Gene name and number 


Mutations with diseases 
in humans 


Null mutation in mice 


Tissue distribution in 
healthy states 


Tissue distribution in 
pathologically affected 
states 

Special domains 


Special neoepitopes 


Protein structure and 
function 


Binding proteins 
Known central function 
Animals models with 


protein affected 


Biomarkers 


Collagen Type 

COL14A1, chromosome 8q23-q24.1 

NCBI accession no.: NM_021110 
Loss-of-function mutation in AAGAB 

in Chinese families with punctuated 
palmoplantar keratoderma. 

Abnormal corneal endothelial maturation in 
collagen types XII and XIV null mice. 

Mice null for collagen XIV are viable; 
however, formation of the interstitial collagen 
network is defective in tendons and skin, 
leading to reduced biomechanical function. 
Studies of type XIV collagen in chicken, 
bovine, and human tissues showed that type 
XIV collagen is prevalent in skin, tendon, 
cornea, and articular cartilage 

Type XIV collagen is observed in the later 
stages of fibrosis 


The two NC1 splice variants expressed during 
development may mediate the maturation of 
fibrils 

Degradation of the COL1 domain of type XIV 
collagen by 92-kDa gelatinase. 

FACIT collagen. It interacts with fibrillar 
collagens to limit fibril diameter by preventing 
lateral fusions of adjacent fibrils 

Possibly decorin and type | collagen 
Regulates fibrillogenesis by limiting fibril 
diameter through the prevention of lateral 
fusion of adjacent fibrils 

In zebrafish, it is transiently expressed in 
epithelia and is required for the proper 
function of certain basement membranes 

NA 


References 
10] 


tl 


8] 


1-4,12] 


Col, Collagen; FACIT, fibril-associated collagen with interrupted triple helices; NA, not applicable; 
NCBI, National Center for Biotechnology Information. 
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SUMMARY 

Collagen type XV is from the multiplexin superfamily of collagens, due to the presence 
of multiple noncollagenous interruptions in their central triple helix. Collagen type XV 
is predominantly located in the basement membrane zones of microvessels and cardiac 
or skeletal myocytes. Type XV collagen is produced mainly by fibroblasts, muscle cells, 
and endothelial cells. To date, there are no known human disease-causing mutations of 
the COLI5A1 gene. However, mice deficient in collagen XV show skeletal myopathy, 
impaired cardiac function, and defects in the microvasculature of the heart and skin. 
The fragment of NC1-XV, restin (also named collagen XV endostatin), has ambiguous 
antiangiogenesis effects, but inhibits endothelial cell migration in vitro. There are 
presently no biomarkers of type XV collagen. 


Collagen type XV is from the multiplexin superfamily of collagens. Collagen 
type XV is predominantly located in the basement membrane zones of microves- 
sels and cardiac or skeletal myocytes. It is also found in kidney and in interstitial 
tissues in the pancreas as well as in testis, ovaries, prostate, small intestine, and 
colon [1]. Type XV collagen is produced mainly by fibroblasts, muscle cells, and 
endothelial cells [2]. To date, there are no known human disease-causing muta- 
tions of the COLI5A1 gene. However mice deficient in type XV collagen show 
skeletal myopathy, impaired cardiac function, and defects in the microvascula- 
ture of the heart and skin [3]. Increased deposition of type XV collagen has been 
observed in fibrotic kidneys and in the sclerotic capillaries of diabetic glomeruli 
[4]. In addition, loss of type XV collagen from the basement membrane of many 
tissues has been observed before metastasis of tumors [5]. 

Type XV collagen shares structural homology with type XVIII collagen. 
Both are assembled into homotrimers and classified under the multiplexin group 
due to the presence of multiple noncollagenous interruptions in their central 
triple helix [6]. Despite both being proteoglycans (PGs), type X VIII collagen is 
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a heparan sulfate PG, whereas type XV collagen has a glycosaminoglycan side 
chain composition that is variable and consists mainly of chondroitin sulfate. 
Although type XVIII collagen is abundant in the liver and present only in small 
amounts in skeletal muscle, collagen XV is absent from the liver and highly 
expressed in skeletal muscle. Type XVIII collagen endostatin derived from the 
NC1-XVIII (noncollagenous C terminal) has been shown to possess [7] antimi- 
gratory and antitumoral properties. In contrast, the corresponding fragment of 
NC1-XV, restin (also named type XV collagen endostatin), has ambiguous 
antiangiogenesis effects, but inhibits endothelial cell migration in vitro [6,8]. 
Restin is less potent as a tumor suppressor than endostatin, has no posttrans- 
lational modifications, and has different binding partners in the extracellular 
matrix [9] from endostatin. Both collagen types XV and XVIII have been shown 
to mediate leukocyte influx in renal ischemia and reperfusion [10]. 


Type XV Collagen Description References 
Gene name and COL15A1 {11] 
number 
Mutations with NA H] 
diseases in humans 
Null mutation in Mild skeletal myopathy with increased [4] 
mice sensitivity to exercise-induced muscle damage. 
Cardiovascular problems such as collapsed 
capillaries and diminished inotropic response 
Tissue distribution in Mainly in basement membrane of microvessels [6,9] 
healthy states or cardiac and skeletal myocytes. Also in kidney, 
pancreas, intestine, prostate 
Tissue distribution Increased deposition in fibrotic kidneys and in the [6,8] 
in pathologically sclerotic capillaries of diabetic glomeruli 
affected states 
Special domains Restin (also known as XV-endostatin), which is [6] 
similar to XVIIl-endostatin 
Special neoepitopes C-terminal (NC1-XV) proteolytic production of [12513] 
restin (homologous to endostatin) 
Protein structure and Proteoglycan. Multiplexin is assembled into [6] 
function homotrimers with multiple noncollagenous 
interruptions in the central triple helix. Variable 
glycosaminoglycan side chain composition 
consisting mainly of chondroitin sulfate. 
Structural homology to collagen type XVIII 
Binding proteins Fibulin-2 and nidogen-2, and about 100-fold less [11] 
to fibulin-1, nidogen-1, laminin-1-nidogen-1 
complex, and perlecan 
Known central Organization, mechanical stability, and integration [1] 
function of the basement membrane to subjacent 
connective tissue 
Animals models with NA [4] 
protein affected 
Biomarkers NA [6,9] 


COL, Collagen; NA, not applicable. 
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SUMMARY 

Type XVI collagen is part of the family of fibril-associated collagens with interrupted 
triple helices that do not form fibrils themselves, but rather associate with fibril-form- 
ing collagens as single molecules. Type XVI collagen is synthesized by various cell 
types including dermal fibroblasts and dendrocytes, keratinocytes, smooth muscle 
cells, and chondrocytes. It is expressed in several tissues such as skin, cartilage, heart, 
intestine, arterial walls, and kidney. Type XVI collagen binds to integrins, types II and 
XI collagen, fibrillin-1, and fibronectin. Its main function is to organize and stabilize 
the extracellular matrix by stabilizing collagen fibrils and focal adhesions, and anchor- 
ing microfibrils to the basement membrane. Furthermore, it mediates intracellular sig- 
naling affecting cell adhesion, proliferation, invasiveness, and the formation of focal 
adhesions. There are currently no mutations reported in man for type XVI collagen, 
and no mouse null mutations have been reported. There are currently no biomarkers 
of type XVI collagen. 


Type XVI collagen is part of the family of fibril-associated collagens with inter- 
rupted triple helices (FACITs) that do not form fibrils themselves, but rather asso- 
ciate with fibril-forming collagens as single molecules [21]. Type XVI collagen 
was identified in 1992 by two groups in parallel by screening human fibroblast 
and placenta cDNA libraries. The gene COL16A1 was localized to band p34-35 
of chromosome 1 [15,23]. The coding sequence is 1604 amino acids long and 
contains 10 collagenous domains of 15—422 residues each and 11 non-collagenous 
(NC) domains, most of which are short (11-39 residues) and cysteine rich [15]. 
The N-terminal NC11 domain, however, consists of 312 residues, is globular, and 
contains a proline/arginine-rich protein (PARP) motif also found in several other 
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collagens and in thrombospondin [15,22]. The COLI6A1 gene encodes the a1 
chain of type XVI collagen, with a molecular mass of approximately 210 kDa, and 
three a1 (XVD chains associate to form homotrimers [7,1 1,22]. Cysteine residues 
primarily located in the NC domains form disulfide bonds that stabilize the trimer 
[11,22]. Multiple kinks introduced by the NC domains provide a highly flexible 
protein structure [11]. 

Type XVI collagen is synthesized by various cell types including der- 
mal fibroblasts and dendrocytes, keratinocytes, smooth muscle cells, and 
chondrocytes. It is expressed in several tissues such as skin, cartilage, 
heart, intestine, arterial walls, and kidney [1,2,7,8,10,13,16]. Until now, 
not much is known about the regulation of gene and protein expression. 
[13][20]a1 (XVI) mRNA levels in human dermal fibroblasts and arterial 
smooth muscle cells were reduced by basic fibroblast growth factor and 
elevated by transforming growth factor (TGF)-f2, whereas the effect on pro- 
tein levels was less pronounced [6,8]. However, stimulation of human coro- 
nary smooth muscle cells with TGF-61 showed no effect on the al(XVI) 
mRNA levels [19]. Type XVI collagen is incorporated into distinct tissue- 
dependent suprastructural structures. In cartilage, type XVI collagen has 
been identified in the territorial matrix associating with thin weakly banded 
collagen fibrils containing types II and XI collagen [10]. The FACIT type 
IX collagen could also be found in these structures, but not in the same 
locations as type XVI collagen, indicating a mutually exclusive pattern. 
Thus, the collagen fibrils may be stabilized by either type XVI or IX col- 
lagen. Interestingly, type XVI collagen does not seem to be associated with 
collagen fibrils in skin. Here, type XVI collagen has been localized to the 
upper papillary dermis in narrow zones near the basement membranes at the 
dermal—epidermal junction, and to blood vessels where it associates with 
fibrillin-1 [2,8,10], the major constituent of beaded microfibrils [18]. This 
association has also been identified in the intestinal wall where type XVI 
collagen is deposited in the submucosa and colocalizes with both fibrillin-1 
and integrin al [16]. A remarkable feature of a FACIT collagen is that it not 
only associates with collagen fibrils but also with non-collagenous proteins 
such as fibrillin-1. The role of type XVI collagen in this context may be 
to stabilize the interactions of microfibrils with other matrix components 
and anchor microfibrils to the basement membrane. Binding of fibronectin 
to recombinant type XVI collagen has been shown [11], but colocalization 
in skin was limited to a restricted area near the basement membrane zone, 
leading the authors to conclude that type XVI was not a component of the 
fibronectin-containing extracellular matrix (ECM) [8]. Type XVI collagen 
can associate with cells via integrins alB1 and «261, thus connecting cells 
with specialized fibrils and contributing to the organization of fibrillar and 
cellularcomponents of the ECM [5]. Furthermore, itinduces integrin-mediated 
signaling resulting in, for example, increased numbers of focal adhesions, 
which leads to better cell spreading [16]. Type XVI collagen has been 
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identified as a component of the dorsal root ganglia ECM in developing 
mice, but it was only found in low levels in adult mice [9,13]. However, its 
expression in adult mice increases around neuronal cell bodies in response 
to nerve injury, suggesting a role in the regeneration process [9]. 

Although type XVI collagen is a minor type of collagen it has been impli- 
cated in several diseases of the ECM. Dermal fibroblasts of patients with 
localized and systemic scleroderma showed type XVI collagen mRNA lev- 
els that were elevated 2.3- and 3.6-fold, respectively, compared with healthy 
controls [2]. Furthermore, deposition of type XVI collagen extended from the 
upper dermis to include the lower dermal matrix in the skin of patients with 
systemic scleroderma, whereas it was only found in the superficial dermis of 
control skin [2]. Epidermolysis bullosa cell lines also showed significantly 
upregulated mRNA levels, further establishing the role of type XVI collagen 
in skin diseases [12]. Type XVI collagen expression is affected in patients 
with Crohn’s disease, which is characterized by chronic inflammation of the 
gastrointestinal tract leading to excessive tissue repair and fibrosis. Intestinal 
subepithelial myofibroblasts (ISEMFs) from the inflamed bowel wall showed 
elevated type XVI gene and protein expression compared with ISEMF from 
noninflamed tissue [16]. The expression of integrin alß1 on the surface of 
ISEMFs originating from inflamed bowel wall was also elevated, resulting in 
increased numbers of focal adhesion contacts with type XVI collagen [16]. 
Thus, type XVI collagen may contribute to sustaining the fibrotic response 
by retaining the collagen-producing ISEMFs in the inflamed tissue. Can- 
cer cells are known to alter the surrounding ECM structure, and evidence 
suggests that this includes type XVI collagen. Overexpression of type XVI 
collagen in the superficial epithelial layers of dysplastic areas of the mucosa, 
but reduced expression in the basement membrane, was seen in patients with 
oral squamous cell carcinoma (OSCC), whereas expression was restricted to 
the basement membrane at the dermal—epidermal junction in normal mucosa 
[17]. Overexpression of type XVI collagen in an OSCC cell line has been 
shown to induce cell proliferation and elevated expression of matrix metallo- 
proteinase (MMP)-9, resulting from the type XVI collagen—mediated activa- 
tion of integrin 61 and kindlin-1 and the subsequent intracellular signaling via 
integrin-linked kinase activation [4,17]. Furthermore, a glioblastoma cell line 
as well as human glioblastoma tissue showed overexpression of type XVI col- 
lagen mRNA and protein compared with normal cells and cortex tissue [20]. 
A type XVI collagen substrate induced increased adhesion of glioblastoma 
cells, whereas cell migration was unchanged [20]. Knockdown of the endog- 
enous protein expression in a glioblastoma cell line reduced glioblastoma cell 
invasiveness and number of focal adhesions as well as integrin activation, but 
it did not affect adhesion capacity or proliferation [3]. However, another study 
using the same cell line showed reduced cell adhesion in response to type XVI 
collagen knockdown [20]. Overexpression of type XVI collagen has also been 
observed in two mouse models of hepatocellular carcinoma [14]. 
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BIOMARKERS OF TYPE XVI COLLAGEN 


As type XVI collagen has been implicated in several diseases the possibility of 
using it as a drug target or biomarker exists. Several cleavage sites and result- 
ing fragments of type XVI collagen have been described, suggesting extensive 
processing of type XVI collagen after secretion [7,11]. Future studies may 
reveal whether any of these fragments are as biologically active as the known 
matrikines released from several other collagens. The induction of MMP-9 
expression described in OSCC [4] may result in further cleavage of type XVI 
collagen, possibly releasing disease-specific protein fragments into the circula- 
tion. The role of type XVI collagen in ECM organization and stability as well as 
cell behavior will surely encourage future investigations into its role in diseases. 


Type XVI Collagen 
Gene name and 
number 

Mutations with 
diseases in humans 
Null mutation in mice 
Tissue distribution in 
healthy states 


Tissue distribution 
in pathologically 
affected states 


Special domains 
Special neoepitopes 


Protein structure and 
function 


Description 
COL16A1 (1p35-p34) 


None identified 


N.A. 

Skin, cartilage, heart, intestine, 
arterial walls, and kidney; fetal brain, 
and skeletal muscle. 


Overexpression in localized and 
systemic scleroderma, extending 
from the papillary dermis to the 
lower dermis. 

Overexpression in the intestinal 
wall in Crohn’s disease. 
Overexpression in cancers 
including glioblastoma, OSCC 
(but reduced in basement 
membrane), and HCC. 
Overexpression in response to 
nerve injury. 


COL1-10, NC1-11, PARP/TSPN 


N-terminal processing results 

in 182- and 78-kDa fragments, 
cleaved at R256-D257 and 
E940-L941, respectively. 
C-terminal and subsequent 
N-terminal processing results 

in 150-, 110-, 50-, and 35-kDa 
fragments. 

FACIT type collagen with kinks 
resulting in flexibility. 

Associated with thin collagen 
fibrils in cartilage territorial matrix. 
Associated with microfibrils near 
basement membranes in the papil- 
lary dermis and intestinal wall 
submucosa. 


References 
NCBI gene ID: 1307 
15,23] 
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Type XVI Collagen Description References 
Binding proteins Integrins a161 and «261, microfibrils, [5,10,11] 


thin weakly banded collagen fibrils 
(containing types II and XI collagen), 
fibrillin-1; possibly fibronectin. 


Known central Organizing and stabilizing ECM by [3,10,16,17,20] 
function stabilizing collagen fibrils and focal 


adhesions, and anchoring microfibrils 
to the basement membrane; 
mediating intracellular signaling 
affecting cell adhesion, proliferation, 
invasiveness, and the formation of 
focal adhesions. 


Animals models with HCC mouse models: PDGFC 14] 
protein affected transgenic and Pten null mice. 
Biomarkers N.A. 


COL, collagen; ECM, extracellular matrix; FACIT, fibril-associated collagens with interrupted triple 
helices; HCC, hepatocellular carcinoma; NCBI, National Center for Biotechnology Information; 
OSCC, oral squamous cell carcinoma; PARP, proline-arginine-rich protein; TSPN, thrombospondin 
N- terminal—like domain. 
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SUMMARY 

Type XVII collagen, also known as 180-kDa bullous pemphigoid antigen, is a type II 
transmembrane collagen. Type XVII collagen is expressed in epithelial hemidesmosomes 
of skin, mucous membrane, and eye and has a series of binding partners. The intracel- 
lular domain binds to {4 integrins, bullous pemphigoid antigen, and plectin, whereas the 
extracellular domain binds to laminin. The putative function of type XVII collagen is to 
stabilize adhesion of epithelial cells to the surrounding extracellular matrix. Mutations in 
type XVII collagen can cause junctional epidermolysis bullosa, with abnormal tooth for- 
mation and epithelial recurrent erosion dystrophy disease. Mouse models display similar 
skin symptoms to those of humans, such as blisters and erosions. Enamel hypoplasia is 
also found in this model, suggesting type XVII collagen also has important roles in teeth 
formation. There are currently no biomarkers for type XVII collagen. 


Type XVII collagen, also known as 180-kDa bullous pemphigoid antigen (BP 180), 
is a type II transmembrane collagen [1]. It is expressed in epithelial hemidesmo- 
somes of skin, mucous membrane, and eye [2]. Type XVII collagen is composed 
of three identical 180-kDa a1 chains [3]. The protein structure includes a globular 
cytoplasmic domain, a transmembrane domain, and an extracellular domain with 
15 collagenous domains separated by 16 noncollagenous domains [4]. The intra- 
cellular domain, transmembrane domain, and extracellular domain include 466, 
23, 1008 amino acids, respectively [1]. Col15 domain, the largest collagenous 
area, is a cell adhesion domain. The noncollagenous NC 16a domain is important 
for the triple-helix folding [5,6]. 

Type XVII collagen has a series of binding partners. The intracellular domain 
binds to $4 integrins, 230-kDa bullous pemphigoid antigen (BP230), and plec- 
tin, whereas the extracellular domain binds to laminin 332 [7-10]. Therefore, it 
is suggested that the function of type XVII collagen is to stabilize adhesion of 
epithelial cells to the surrounding extracellular matrix (ECM) [1]. 
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The extracellular domain of type XVII collagen can be shed from the cell 
surface by A Disintegrin and Metalloproteinases (ie, ADAMs) and tumor necro- 
sis factor-a—converting enzyme [3,11,12], and release a soluble domain that is 
usually referred to as 120-kDa linear immunoglobulin (Ig)A dermatosis antigen 
(LAD-1) [13,14]. However, the precise cleavage site on the collagen is unclear 
since studies indicate a different N terminus for the shedding: Ala°?8 from 
HaCaT cells; Leu>?* from DJM-1 cells; and Leu52*, Gln°?5, and Gly°2° from 
an in vitro primary normal human keratinocyte experiment [15,16]. A study 
showed the main physiological cleavage site in human skin could be between 
Leu”? and Gin5*5 [17]. Besides the 120-kDa fragment, a 97-kDa linear IgA 
dermatosis antigen (97-LAD) from the ectodomain of type XVII collagen is 
also found in the epidermis, with the N terminus starting from amino acid 531 
[18,19]. The effect of shedding type XVII collagen is still unclear, but poten- 
tially, it may release the cell from the surrounding matrix and allow it to perform 
other functions [1]. 

Autoantibodies against type XVII collagen are found in autoimmune bullous 
skin disease. Studies have shown that autoantibodies in serum from some auto- 
immune bullous pemphigoid patients predominantly react with the NC16a or 
COL15 domain of type XVII collagen [5,20—22]. Linear IgA disease is another 
pemphigoid-like illness and characterized as a subepidermal blistering disorder. 
In these patients, IgA autoantibodies preferentially target LAD-1 and 97-LAD 
instead of the full-length type XVII collagen [22]. 

Mutations in type XVII collagen can cause junctional epidermolysis bullosa 
(JEB). Abnormal tooth formation is also found in some JEB patients [2,23—29]. 
In JEB, mutations in type XVII collagen result in decreased levels or absence 
of this protein [30]. In epithelial recurrent erosion dystrophy disease, a novel 
mutation in type XVII collagen has been identified [31]. Type XVII collagen 
functions have been further investigated in the COL17 knockout mouse model. 
The knockout mice show similar skin symptoms to those of humans, such as 
blisters and erosions [32]. Enamel hypoplasia is also found in this model, sug- 
gesting type XVII collagen also has important roles in teeth formation [33]. 


Type XVII collagen Description References 
Gene name and number COL17A1, location 10q24.3 (gene ID: 1308) NCBI 
Mutations with diseases Junctional epidermolysis bullosa; abnormal [2,23-29,31] 
in humans tooth formation; epithelial recurrent erosion 
dystrophy 
Null mutation in mice Blisters and erosions on the skin; enamel [32,33] 
hypoplasia 
Tissue distribution in Expressed in epithelial hemidesmosomes of 2) 
healthy states skin, mucous membrane, and eye 
Tissue distribution in In junctional epidermolysis bullosa, [30] 
pathologically affected mutations in type XVII collagen also result in 
states decreased levels or absence of this collagen. 


Special domains LAD-1, 97-LAD {13,14,18,19] 


Type XVII Collagen Chapter |17 109 


Type XVII collagen Description References 

Special neoepitopes The cleavage sites for LAD-1 and 97-LAD [15,18] 
fragments 

Protein structure and Composed of three identical a1 chains. The [3-6] 

function collagen includes a globular intracellular 


domain, a transmembrane domain, and an 
extracellular domain that is composed of 15 
collagenous domains and 16 noncollagenous 
domains. 

Col15 domain, the largest collagenous 
region, is a cell adhesion domain. The 
noncollagenous NC16a domain is important 
for the triple-helix folding. 


Binding proteins The intracellular domain binds to B4 [7-10] 


integrins, 230-kDa bullous pemphigoid 
antigen (BP230) and plectin, whereas the 
extracellular domain binds to laminin 332. 


Known central function Stable adhesion of epithelial cells to the [1] 
surrounding ECM 

Animal models with Col17 knockout mouse [B233] 

protein affected 

Biomarkers N.A. 


COL, Collagen; NCBI, National Center for Biotechnology Information; LAD, linear immunoglobu- 
lin A dermatosis antigen; BP, bullous pemphigoid antigen; ECM, extracellular matrix; N.A., not 
applicable. 
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Type XVIII collagen possesses features of collagens and proteoglycans and is localized 
in various basement membrane zones. Endostatin, the carboxyl-terminal fragment of type 
XVII collagen, has been found to inhibit angiogenesis and tumor growth. There are three 
isoforms of type XVIII collagen: short, middle, and long. The short isoform is present 
in vascular and epithelial basement membrane structures, and the long isoform is highly 
expressed in the liver. Nonsense and missense mutations of type XVIII collagen lead to 
the autosomal recessive disorder Knobloch syndrome-1, which is characterized by eye 
abnormalities. COL18A1-—/— knockout mice likewise suffer from eye abnormalities and 
manifest broadened basement membrane structures. This suggests that type XVII colla- 
gen not only is involved in eye development but also has an important role in maintaining 
basement membrane integrity. Type XVIII collagen is the prime example of a collagen 
that holds structural properties in the intact form and signaling potential in the degraded 
form. 


Abbreviations 

Col Collagenous domain 

HSGAG Heparan sulphate glycosaminoglycan 
NC Noncollagenous region 

TSP-1 Thrombospondin 1 


Type XVIII collagen possesses features of collagens and proteoglycans and is, 
together with collagen XV, part of the multiplexin family [1]. Multiplexins differ 
from other known collagens by having triple helical domains flanked by unique 
nontriple helical regions, which possibly allows flexibility in the collagen mol- 
ecule [2]. 
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FIGURE 18.1 Schematic illustration of type XVIII collagen. The COL18A1 gene encodes 
three distinct isoforms: short, middle, and long. All three isoforms contain a thrombospondin (TSP) 
1-like domain and 10 triple helical collagenous domains (Col1—10) flanked by 11 noncollagenous 
regions (NC1-11). The NC1 domain contains three functionally distinct subdomains: a trimeriza- 
tion domain, a protease sensitive hinge region, and a C-terminal endostatin domain. The long iso- 
form contains an additional cysteine-rich Frizzled domain. 


Using two different promoters and alternative splicing of exon 3, the 
COL18A1 gene encodes three distinct isoforms: short, middle, and long [3]. 
The long isoform differs from the middle and short isoforms by containing an 
additional cysteine-rich Frizzled domain, which when proteolyzed is able to 
inhibit Wnt/B-catenin signaling by binding to Wnt molecules [4]. 

All three isoforms contain a thrombospondin (TSP) 1-like domain and 10 
triple helical collagenous domains (Col1—10) flanked by 11 noncollagenous 
regions (NC1-11). The NC1 domain, common to all three isoforms, contains 
three functionally distinct subdomains: a trimerization domain, a protease- 
sensitive hinge region, and a C-terminal endostatin domain that has antiangiogenic 
properties [4] (Fig. 18.1). 

Type XVII collagen also harbors three Ser-Gly consensus sites that function 
as attachment sites for heparan sulphate glycosaminoglycan (HSGAG) chains 
[5,6]. Studies show that the HSGAG chains mediate binding with the basement 
membrane [5]. Furthermore, the HSGAG chains have been reported to inter- 
act with L-selectin, an adhesion protein involved in leukocyte migration, and 
with monocyte chemoattractant protein-1, that through binding with type XVIII 
collagen, induces a4/1 integrin activation of monocytes [7]. The HSGAG 
chains also bind the cell adhesion protein receptor protein tyrosine phosphatase 
o that is involved in development of the nervous system [8]. 

All three isoforms of type XVIII collagen are localized in various base- 
ment membrane zones where they interact with laminin, perlecan, nidogen, and 
fibulins [9]. The short isoform is present in vascular and epithelial basement 
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membrane structures, the long isoform is highly expressed in the liver, and the 
intermediate isoform is not widely expressed [10,11]. Type XVIII collagen has 
been shown to be located in almost all structures of the human eye, in fat tissue 
during adipose differentiation, in hair follicles, in articular cartilage, in bone 
marrow, in the kidneys, and in human plasma [12-17]. 

Nonsense and missense mutations of type XVIII collagen lead to the autoso- 
mal recessive disorder Knobloch syndrome-1, which is characterized by eye abnor- 
malities including high myopia, vitreoretinal degeneration, dislocated lenses, retinal 
detachment, macular abnormalities, and occipital encephalocele [18]. Lack of either 
the short or all isoforms of type XVIII collagen causes Knobloch syndrome-1, but 
patients who lack all isoforms present with more severe eye abnormalities [11]. 

COL18A1-/- knockout mice likewise suffer from eye abnormalities includ- 
ing abnormal outgrowth of retina vessels, fragile iris, and accumulation of 
electron-dense deposits [19,20]. Furthermore, COLI8A1-—/— mice manifest 
broadened basement membrane structures in the atrioventricular valves of the 
heart, in kidney tubules, in skin epidermis, and in the choroid plexuses [21]. 
This suggests that type XVIII collagen not only is involved in eye development 
but also has an important role in maintaining basement membrane integrity. 

Increased amounts of type XVIII collagen and endostatin have been 
observed in several pathological conditions. Patients with Alzheimer disease 
show accumulation of the protein in cerebrospinal fluids, amyloid-laden ves- 
sels, and senile plaques of the brain, and increased expression of type XVIII 
collagen has been observed in bullous scleroderma skin [22,23]. Moreover, high 
serum endostatin levels are associated with increased cancer and cardiovascular 
death. Type XVIII collagen levels are overexpressed and associated with a poor 
outcome in patients with non-small-cell lung cancer (NSCLC), colorectal can- 
cer, bladder cancer, and hepatocellular carcinoma [24—27]. 


ROLE OF ENDOSTATIN IN BIOLOGY AND PATHOLOGY 


Endostatin, the carboxyl-terminal fragment of type XVIII collagen, was discov- 
ered in 1997 and described as an inhibitor of angiogenesis and tumor growth 
in murine hemangioendothelioma cells, and has since been intensively stud- 
ied [28]. The hinge region of endostatin contains several proteolytic cleavage 
sites where matrix metalloproteinases (MMPs), cathepsins, and elastases are 
able to cleave off endostatin [29,30]. In its cleaved form, endostatin is able to 
interact with cell membrane receptors and inhibit endothelial cell proliferation 
and migration and stimulate cell death by apoptosis [28,3 1—33]. Through these 
mechanisms, endostatin has been shown to be an effective inhibitor of tumor 
growth and has, in a modified recombinant form, been released for treatment of 
patients with NSCLC in China [34]. 

The exact mechanism of action of endostatin is not completely known. A 
broad range of targets have been investigated, and studies point to multiple 
signaling pathways being involved in mediating the effects of endostatin, as 


116 Biochemistry of Collagens, Laminins and Elastin 


reviewed by Seppinen et al. and Digtyar et al. [6,35]. In brief, endostatin has 
been shown to directly bind the vascular endothelial growth factor (VEGF) 
receptor KDR/FIk-1, thereby interfering with VEGF signaling leading to inhibi- 
tion of proliferation and migration of endothelial cells [32]. Endostatin can also 
promote f-catenin degradation, leading to cell cycle arrest through inhibition of 
the cyclin-D1 promoter [36]. Furthermore, studies show that endostatin is able 
to bind to a5ß1 integrin caveolin-1, leading to a decreased cell migration and 
inhibit the activation or promatrix MMP-2 [37,38]. 

Endostatin has also been reported to be important in several other patholo- 
gies besides cancer. In a mouse ectopic ossification model endostatin was able 
to reduce bone formation [39]. In other mouse models endostatin has also been 
observed to inhibit progression of peritoneal sclerosis and diabetic nephropathy 
[40,41] (Fig. 18.2). 

In cardiac diseases such as coronary heart disease and myocardial infarc- 
tion increased levels of endostatin have been observed [42,43]. Endostatin may 
suppress aberrant tissue remodeling and scarring [42,44,45]. In a study in rats 
with myocardial infarction, neutralization of endostatin worsened the outcomes 
of myocardial infarction [42]. In another study by Yamaguchi et al., a peptide 
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FIGURE 18.2 Multiple roles of endostatin in biology and pathology. Endostatin is able to 
inhibit angiogenesis through induction of endothelial cell death and inhibition of endothelial cell 
proliferation and migration. In mouse models endostatin have been reported to be able to inhibit the 
progression of myocardial infarction, inhibit the progression of diabetic nephropathy, reduce inflam- 
mation, reduce bone formation, inhibit progression of peritoneal sclerosis and prevent dermal fibrosis. 
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from the C-terminal end of endostatin prevented transforming growth factor-p— 
induced dermal fibrosis ex vivo and in vivo [45]. 

Endostatin may also have beneficial effects on the development of inflam- 
matory diseases. In a septic mouse model endostatin increased survival by 
reducing multiple organ dysfunction syndrome, and in rheumatoid arthritis 
mouse model endostatin was able to reduce inflammation and angiogenesis 
[46,47] (Fig. 18.2). 


BIOMARKERS OF ENDOSTATIN 


Endostatin has, in several studies, been shown to have potential as a noninva- 
sive prognostic cancer biomarker, where a high level of endostatin is a marker 
for worse prognosis [24—27]. Endostatin has also been evaluated as a predictive 
biomarker and as an efficacy biomarker of cancer treatment, although to a lesser 
extent [48]. Alba et al. showed that serum endostatin was a predictive factor in 
metastatic breast cancer patients treated with the aromatase inhibitor letrozole 
[49]. Ohlund et al. showed that increased serum endostatin levels in patients with 
pancreatic cancer normalize after surgery or intraperitoneal chemotherapy [50]. 

Together, these results highlight the potential of endostatin as a cancer bio- 
markers. However, no biomarkers and antibodies specific to endostatin cur- 
rently exist. Commercially available antibodies recognize both endostatin still 
attached to type XVIII collagen and the cleaved and active endostatin fragment. 
A biomarker specific to the cleaved endostatin fragment may therefore have 
improved predictive and prognostic value. 


Collagen type XVIII Description References 
Gene name and COL18A1 (21q22.3) RefSeq: NM 
number 130445 
Mutations with Knobloch syndrome [18] 
diseases in humans 

Null mutation in mice ©COL18A1-/— knockout mice suffer from [19-21] 


eye abnormalities and manifest broadened 
basement membrane structures in the 
atrioventricular valves of the heart, in 
kidney tubules, in skin epidermis, and in the 
choroid plexuses 

Tissue distribution in Localized in various basement membrane [12-17] 

healthy states zones and is located in almost all structures 
of the human eye, in fat tissue during adipose 
differentiation, in hair follicles, articular 
cartilage, bone marrow, heart, brain, liver, 
the kidneys, and in human plasma 


Tissue distribution Increased amounts of type XVIII collagen [22-27,51] 
in pathologically and endostatin have been observed in 
affected states patients with Alzheimer disease, traumatic 


brain injury, bullous scleroderma skin, 
NSCLC, colorectal cancer, bladder cancer, 
and hepatocellular carcinoma 
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Collagen type XVIII Description References 

Special domains TSP-1-like domain, Frizzled domain, [3] 
endostatin, and trimerization domain 

Special neoepitopes Endostatin, Fizzled domain [3] 

Protein structure Type XVIII collagen occurs in three [3] 


N-terminal isoforms: short, middle, and 
long. 

Three identical «1 chains assemble into 
left-handed triple helices. All type XVIII 
collagen isoforms contain a TSP-1-like 
domain and 10 triple helical collagenous 
domains (Col1-10) flanked by 11 
noncollagenous regions (NC1-11). The NC1 
domain contains a trimerization domain, 

a protease-sensitive hinge region, and a 
C-terminal endostatin domain. All three 
isoforms also contain heparan sulphate side 
chains, and the long isoform contains a 
cysteine-rich Frizzled domain 


Binding proteins Laminins, perlecan, nidogen/entactin, and [9] 

fibulins 
Known central Maintaining basement membrane integrity, [4,18,21,28] 
functions in man inhibition of angiogenesis, inhibition of 


Wnt/B-catenin signaling and development 
of the eye 


Animals models with NA 
protein affected 
Biomarkers Endostatin can be used as a prognostic [48] 


cancer biomarker, but so far no biomarkers 
specific for the cleaved endostatin fragment 
exist. 


COL, Collagen; Col, collagenous domain; NA, not applicable; NC, noncollagenous region; NSCLC, 
Non-small-cell lung cancer. 
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SUMMARY 

Type XIX collagen is a member of the fibril-associated collagens with interrupted triple 
helices, and it acts as a cross-bridge between fibrils and other extracellular molecules. 
Structurally, type XIX collagen is a homotrimer, composed by three a1 chains. It is 
expressed in vascular, neuronal, mesenchymal, and epithelial basement membrane zones 
in the breast, colon, kidney, liver, placenta, prostate, skeletal muscle, skin, and spleen. 
This collagen has been shown to affect the phenotype for smooth muscle motor dysfunc- 
tion and hypertension sphincter, by dysregulation of extracellular matrix. The functional 
role of type XIX collagen is not well established; however, biomarkers could assist the 
quantification of this protein in both healthy and pathological conditions. 


Type XIX collagen is a member of the fibril-associated collagens with interrupted 
triple helices [1,2] and acts as a cross-bridge between fibrils and other extracel- 
lular matrix molecules. Type XIX collagen is a homotrimer of 400 kDa, composed 
of three al chains. Each of these chains consists of 1142 residues. In each, the 
collagenous region includes five 70—224-residue triple helical subdomains 
(COL1-COLS5S) separated by internal 20-44-amino acid noncollagenous domains 
(NC1-NC6) [1,3]. The gene is localized to 6q12-q14, which is the same region as 
the a([X) and a1(XID) collagen genes [1]. It is a minor collagen first discovered 
in a human rhabdomyosarcoma cell line and now known to be distributed in vas- 
cular, neuronal, mesenchymal, and epithelial basement membrane zones (BMZs) 
in the breast, colon, kidney, liver, placenta, prostate, skeletal muscle, skin, and 
spleen [3-5]. Collagen type XIX is also expressed by a subset of hippocampal 
neurons that are necessary for formation of the hippocampal synapses. However, 
the exact role in the brain is unknown [6]. 
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Characterization of the type XIX collagen gene (Col/9a/) by null mice and 
structural mutations within the gene, has illustrated the contribution of this col- 
lagen to muscle physiology and differentiation of muscle cells [7,8]. The collagen 
has been shown to affect the phenotype for smooth muscle motor dysfunction and 
hypertension sphincter. Muscle dysfunction is a cause of a dysregulated matrix. 
Mice without type XIX collagen exhibit an additional effect known as smooth- 
to-skeletal muscle cell conversion in the abdominal segment of the esophagus [7]. 
Besides its role in muscle composition, type XIX collagen is expressed during 
embryogenesis together with other proteins in extracellular matrix assembly [9]. 
During invasive stages of breast cancer, collagen type XIX disappears from the 
BMZ [10]. The inhibitory effect of collagen type XIX was also demonstrated in a 
murine melanoma model in which the NC1, C-terminal domain of collagen type 
XIX, inhibited tumor cell migration [5]. This effect is furthermore demonstrated 
in vitro by inhibition of migration capacities of tumor cells [11]. 


BIOMARKERS OF TYPE XIX COLLAGEN 


Assays targeting type XIX collagen have already been developed by using quan- 
titative PCR, Western blot, and enzyme-linked immunosorbent assays in tissues 
and body fluids [9]. 


Type XIX collagen Description References 

Gene name and COL19A1, location 6q12q14 NCBI Gene 

number ID: 1310 
[1,2] 


Mutations with N.A. 
diseases in humans 
Null mutation in mice Col19a null mice display muscle physiology and [7] 


hypertension sphincter 


Tissue distribution in Expressed in endothelial, smooth muscle, [3,4,10] 
healthy states neuronal, mesenchymal, and most epithelial 
cells located in the basement membrane zones 
in the breast, colon, kidney, liver, placenta, 
prostate, skeletal muscle, skin, and spleen 
Tissue distribution in Decreased levels of collagen type XIX have E 
pathologically affected been observed in the BMZs in breast cancer at 
states invasive stages 
Special domains N.A. 
Special neoepitopes No special neoepitopes have been discovered [10] 
but the specificity of MMPs and other 
proteinases has not yet been determined 
Protein structure Collagen type XIX is composed by 1142 amino [1,3] 


acids, including a 23-amino acid signal peptide, 
a 268-residue amino-terminal domain, an 
832-residue collagenous region, and a 19-amino 
acid carboxylpeptide. 

The collagenous domain is composed of five 
70-224-residue triple helical subdomains 
(COL1- COLS) separated by internal 20-44- 
amino acid noncollagenous domains (NC1—NC6) 
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Type XIX collagen Description References 
Binding proteins N.A. 

Known central Acts as a cross-bridge between fibrils and other [1] 
functions in man ECM molecules 

Animals models with N.A. 

protein affected 

Biomarkers N.A. 


COL, Collagen; NCBI, National Center for Biotechnology Information; N.A., not applicable; BMZ, 
basement membrane zone; MMP, matrix metalloproteinase; ECM, extracellular matrix. 
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SUMMARY 

Type XX collagen is assigned to the fibril-associated collagens with interrupted triple heli- 
ces collagen subfamily due to its similarities with types XII and XIV collagen. The tissue 
distribution is relatively large, and it is elevated in different types of cancer. Structurally, 
type XX collagen is made up of six fibronectin type III repeat domains, a von Willebrand 
factor A domain, a laminin G-like domain (also known as the thrombospondin domain), 
and two triple helical domains. The function of type XX collagen is relatively unknown, 
and good tools are needed to investigate and quantify the biological role of this collagen. 


Type XX collagen, a 136-kDa secreted protein, became a member of the collagen 
family in 2001 when chick embryo cDNA was isolated and sequenced [1]. Type 
XX collagen is assigned to the fibril-associated collagens with interrupted triple 
helices (FACIT) collagen subfamily due to its similarities with types XII and XIV 
collagen. Structurally, type XX collagen is made up of six fibronectin type II 
repeat (FN) domains, a von Willebrand factor A (vWA) domain, a laminin G-like 
(LGL) domain (also known as the thrombospondin domain), and two triple heli- 
cal domains (Fig. 3, Introduction). 

In addition to the canonical sequence, two other isoforms of human type XX 
collagen produced by alternative splicing have been described [2]. Both of these 
splice variants contain inserted additional amino acids resulting in masses of 
137 and 139kDa, respectively. Posttranslational modifications have been pro- 
posed for type XX collagen. Multiple phosphorylations are found at, or near, 
the first and fourth FN domains [3]. At the fourth FN domain, a glycosylation 
at amino acid position 607 has also been found. An acetylation at amino acid 
position 117 has been described as well. 
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Type XX collagen forms a protein complex consisting of three al chains 
assembled into left-handed triple helices [4]. In proteomic analysis type XX 
collagen has been found to colocalize with collagens and other proteins of the 
extracellular matrix [5]. Based on its similarities to types XII and XIV collagen, 
it is expected to interact with fibrillar collagens through the C-terminal domain 
and project the N-terminal domain away from the fibrils [4]. Thus, type XX col- 
lagen is able to form bridges linking the fibrillar collagens. Due to the presence 
of the vWA domain [6], type XX collagen is also likely to be involved in cellular 
adhesion, migration, differentiation, and signaling. 

Initially, type XX collagen mRNA was found most prevalent in corneal 
epithelium [1]. It was also detectable in embryonic skin, sternal cartilage, and 
tendon, but it was barely detectable in the calvaria, notochord, or neural retina, 
suggesting that it was not expressed in these tissues. However, based on a poly- 
clonal antibody (HPA05 1962) targeting a recombinant protein fragment of type 
XX collagen, most tissues display moderate-to-strong nuclear and/or cytoplas- 
mic collagen type XX protein staining, with only bile ducts, breast, cerebellum, 
smooth muscle cells, and soft tissues staining weakly or negatively with this 
antibody [7]. Using the HPA051962 antibody, several cancer tissues have been 
found to express type XX collagen as well. 


Type XX 

collagen Description References 
Gene name COL20A1 (20q13.33) Gene ID: 
and number 57642 
Isoforms Isoform 1: 1284 aa, 136kDa [2] 

(human) Canonical sequence 


Isoform 2: 1297 aa, 137 kDa 

aa 165-166: PA—PGGSEWRET 

aa 1204-1205: AS>ACESAIQT 

Isoform 3: 1316 aa, 139kDa 

aa 1098-1099: 

R> RGEPGPPGQMGPEGPGGQQGSPGTQGRAVQGPV 
Mutations with N.A. 


diseases in 

humans 

Null mutation N.A. 

in mice 

Tissue Most tissues display moderate-to-strong expression with [7] 


distribution in only bile ducts, breast, cerebellum, smooth muscle 
healthy states cells, and soft tissues showing low or negative presence 


Tissue Several cancer tissues [7] 
distribution in 

pathologically 

affected states 

Special Six FN repeat domains, a WA domain, an LGL domain/ [1,2] 
domains TSP domain, and two triple helical domains 
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Type XX 
collagen Description References 
Special N.A. 
neoepitopes 
Protein Three a1 chains assemble into left-handed triple helices [4] 
structure and 
function 
Binding Fibrillar collagens, similar collagen types XII and XIV [4 
proteins 
Known central Expected to form bridges linking larger fibrillar [4 
function collagens, as do types XII and XIV 
Cellular adhesion, migration, differentiation, and [6] 
signaling 
Animals N.A. 


models with 

protein affected 

Biomarkers of IHC antibody (HPA051962) [7] 
that protein 


COL, Collagen; aa, amino acids; N.A., not applicable; FN, fibronectin type III repeat; vWA, von 
Willebrand factor A; LGL, laminin G-like; TSP, thrombospondin; /HC, immunohistochemistry. 


REFERENCES 

[1] Koch M, Foley JE, Hahn R, Zhou P, Burgeson RE, Gerecke DR, et al. Alpha 1(Xx) collagen, 
a new member of the collagen subfamily, fibril-associated collagens with interrupted triple 
helices. J Biol Chem 2001;276:23 120-6. 

[2] Magrane M, Consortium U. UniProt Knowledgebase: a hub of integrated protein data. Data- 
base (Oxford) 2011;2011:bar009. 

[3] Hornbeck PV, Zhang B, Murray B, Kornhauser JM, Latham V, Skrzypek E. PhosphoSitePlus, 
2014: mutations, PTMs and recalibrations. Nucleic Acids Res 2015;43:D512-20. 

[4] Ricard-Blum S. The collagen family. Cold Spring Harb Perspect Biol 2011;3:a004978. 

[5] Barallobre-Barreiro J, Didangelos A, Schoendube FA, Drozdov I, Yin X, Fernandez-Caggiano 
M, Willeit P, Puntmann VO, Aldama-Lopez G, Shah AM, et al. Proteomics analysis of cardiac 
extracellular matrix remodeling in a porcine model of ischemia/reperfusion injury. Circulation 
2012;125:789-802. 

[6] Colombatti A, Bonaldo P, Doliana R. Type A modules: interacting domains found in several 
non-fibrillar collagens and in other extracellular matrix proteins. Matrix 1993;13:297-306. 

[7] Uhlen M, Bjorling E, Agaton C, Szigyarto CA, Amini B, Andersen E, Andersson AC, Angeli- 


dou P, Asplund A, Asplund C, et al. A human protein atlas for normal and cancer tissues based 
on antibody proteomics. Mol Cell Proteomics 2005;4:1920-32. 


Chapter 21 


Type XXI Collagen 


S.N. Kehlet, M.A. Karsdal 


Nordic Bioscience, Herlev, Denmark 


Chapter Outline 
Biomarkers of Type XXI Collagen 132 
References 133 


SUMMARY 

Type XXI collagen belongs to the fibril-associated collagens with interrupted triple heli- 
ces (FACIT) family of collagen proteins. Type XXI collagen may copolymerize with 
fibrillar collagens via their C-terminal collagenous domains and mediate protein-protein 
interactions through their N-terminal noncollagenous domains, thereby serving as 
molecular bridges in the extracellular matrix. Type XXI collagen is expressed in heart, 
placenta, stomach, jejunum, skeletal muscle, kidney, lung, pancreas, and lymph node. 
Currently, there are no known mutations in humans and mice null mutation work has 
not been reported. The biological function of type XXI collagen remains to be revealed; 
however, data suggest that type XXI collagen may play a role in blood vessel assembly. 
Biomarkers need to be developed to determine the function and relevance under patho- 
logical conditions. 


Type XXI collagen belongs to the fibril-associated collagens with interrupted 
triple helices (FACIT) family of collagen proteins [1]. These collagens consist of 
two or more collagenous domains that are interrupted by several noncollagenous 
domains. They often copolymerize with fibrillar collagens via their C-terminal 
collagenous domains and mediate protein—protein interactions through their 
N-terminal noncollagenous domains, thereby serving as molecular bridges in the 
extracellular matrix (ECM) [2,3]. 

The COL2/A/ open reading frame encodes a 957-amino acid protein with 
a predicted molecular mass of 99kDa. The protein possesses a 22-amino acid 
signal peptide followed by a von Willebrand factor A domain participating in 
protein-protein interactions and a thrombospondin domain. The C-terminal 
consists of two collagen triple helical domains interrupted by two short non- 
collagenous domains [1] (Fig. 3, Introduction). Type XXI collagen mRNA has 
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been shown to be present in ECM-rich tissue, in particular the heart, placenta, 
stomach, jejunum, skeletal muscle, kidney, lung, pancreas, and lymph node [1]. 
These tissues are also enriched with type I collagen. Two other members of 
the FACIT collagen family, type XII and XIV collagens, have been shown to 
colocalize with type I collagen. Type XXI collagen may also colocalize with 
type I collagen, suggesting a role of maintaining the integrity of the ECM [1]. 
However, this hypothesis needs to be investigated further. 

The protein is an extracellular component of blood vessel walls and is 
secreted by vascular smooth muscle cells. It has been speculated that type XXI 
collagen may have a role in blood vessel assembly since its expression is regu- 
lated by platelet-derived growth factor, which induces proliferation and migra- 
tion of smooth muscle cells [4]. 


BIOMARKERS OF TYPE XXI COLLAGEN 


The ECM can prevent drugs from penetrating into cancer cells [5—7]. A study 
has shown that COL2/A1 is significantly upregulated in chemoresistant variants 
of an ovarian cancer cell line [8]. This upregulation indicates that type XXI col- 
lagen may be used as a biomarker in ovarian cancer. 


Type XXI collagen Description References 
Gene name and number COL21A1 NCBI gene ID: 81578 
Mutations with diseases in N.A. 
humans 
Null mutation in mice N.A. 
Tissue distribution in healthy Heart, placenta, stomach, [1] 
states jejunum, skeletal muscle, 
idney, lung, pancreas, and 

lymph node 
Tissue distribution in N.A. 
pathologically affected states 
Special domains vWA and TSP [1] 
Special neoepitopes N.A. 
Protein structure and function Consists of a 22-aa signal [1] 


peptide, two collagenous 
domains interrupted by three 
noncollagenous domains. 

Binding proteins N.A. 

Known central function The function remains [4] 
unknown, but could be 
implicated in blood vessel 


assembly 
Animals models with protein N.A. 
affected 
Biomarkers N.A. 


aa, amino acids; COL, Collagen; NCBI, National Center for Biotechnology Information; N.A., not 
applicable; vWA, von Willebrand factor A; TSP, thrombospondin. 
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SUMMARY 

Type XXII collagen is a member of the fibril-associated collagen with interrupted triple 
helices collagen family. The N terminus has a von Willebrand factor A domain and a 
thrombospondin domain, both containing N-glycosylation sites. Type XXII collagen is 
predominantly found in the heart and skeletal muscle of human tissue and to a lesser extent 
in cartilage and skin in mice. It has been identified in specific tissue junctions, namely, the 
myotendinous junctions (MTJs) that represent the major site of muscle force transmission 
to tendons. Type XXII collagen binds to integrins, especially 0261 and a1161, at MTJs, 
indicating a linking and mechanical stability function of collagen XXII at these points and 
a role as a cell adhesion ligand. Null mutation results in a muscular dystrophy-like phe- 
notype by disrupting the MTJs. Lastly, the expression level has been shown to be elevated 
in certain cancer types and associated with kidney function. Currently there are no known 
biomarkers that may assist in clarifying the functional role of type XXII collagen. 


Type XXII collagen is a member of the fibril-associated collagen with inter- 
rupted triple helices (FACIT) collagen family. COL22A/ has an open read- 
ing frame of 1626 amino acids, including a 27-amino acid signal peptide. The 
protein consists of six collagenous domains interrupted by several noncollag- 
enous domains (Fig. 3, Introduction). The N terminus has a von Willebrand 
factor A (vWA) domain and a thrombospondin (TSP) domain, both containing 
N-glycosylation sites. Type XXII collagen orthologs have been identified in 
mouse, zebrafish, and puffer fish [1,2]. 

mRNA analyses have shown that type XXII collagen is predominantly found 
in the heart and skeletal muscle of human tissue and to a lesser extent in carti- 
lage and skin in mice. It has been identified in specific tissue junctions, namely, 
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the myotendinous junctions (MTJs) that represent the major site of muscle force 
transmission to tendons. Muscle cells close to MTJs express type XXII collagen 
that is deposited in the basement membrane zones of MTJs. Electron micros- 
copy studies have shown that type XXII collagen, unlike other FACIT collagens, 
does not directly polymerize with fibrillar collagens, but rather associates with 
components of microfibrils such as fibrillins [1]. The vWA domain may medi- 
ate binding of type XXII collagen to components of the basement membrane. It 
has been shown that type XXII collagen binds to integrins, especially «261 and 
al1B1, at MTJs, indicating a linking and mechanical stability function of type 
XXII collagen at these points and a role as a cell adhesion ligand [3]. 

Knockdown of COL22A/ in zebrafish results in a muscular dystrophy— 
like phenotype. It is accompanied by a highly significant reduction of muscle 
force caused by a lower level of force transmission capacity together with fiber 
detachment, all contributing to a reduced number of contracting fibers [4]. This 
suggests that type XXII collagen functions as a stabilizer and contributes to 
the maintenance of muscle attachments and efficient transmission of contrac- 
tile forces. The same study also showed that the dystrophic phenotype could 
be reversed by microinjection of human recombinant type XXII collagen pro- 
tein, indicating that COL22A/ is a candidate gene for muscular dystrophies in 
humans. 


BIOMARKERS OF TYPE XXII COLLAGEN 


Preliminary studies suggest that type XXII collagen could have potential as a 
valuable prognostic biomarker in cancer. MRNA levels have been shown to be 
significantly higher in head and neck cancer tissue compared to normal tissue 
from the same individual. The increased mRNA level correlates with lymph 
node metastasis and pathological stage [5]. Furthermore, a proteomic study has 
shown that type XXII collagen is a tumor-specific protein characteristic of the 
colon tumor matrisome [6]. 

An association between COL22A/ and the level of serum creatinine, the 
most important biomarker for assessment of kidney function, has been detected 
in a meta-analysis of genome-wide data. This could indicate a biological rela- 
tionship between muscle mass formation and creatinine levels [7]. 


Type XXII Collagen Description References 
Gene name and COL22A1 NCBI gene ID: 
number 169044 
Mutations with NA 
diseases in humans 
Null mutation in Knockdown of COL22A1 in zebrafish [4] 
zebrafish results in a muscular dystrophy-like 

phenotype by disrupting the myotendinous 

junction 
Tissue distribution in | Myotendinous junctions primarily of the [1] 


healthy states heart and skeletal muscle 
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Type XXII Collagen Description References 
Tissue distribution NA 

in pathologically 

affected states 


Special domains vWA and TSP {1] 

Special neoepitopes NA 

Protein structure and Comprises a 27-aa signal peptide, six [2] 

function collagenous domains interrupted by several 
noncollagenous domains 

Binding proteins Fibrillins, integrins (22681 and a11ß1) [1,3] 

Known central The function remains unknown, but may [1,3] 

function contribute to the mechanical stability of 
myotendinous junctions 

Animals models with Zebrafish [4] 

protein affected 

Biomarkers NA 


aa, amino acids; COL, Collagen; NA, not applicable; NCBI, National Center for Biotechnology 
Information; TSP, thrombospondin; vWA, von Willebrand factor A. 
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SUMMARY 

Type XXIII collagen is a type II transmembrane collagen. It is found in healthy human 
and mouse tissues of the lung, cornea, skin, tendon, and amnion and to a lesser extent in 
the kidney and placenta. Currently, its function is unknown; however, type XXIII collagen 
is acomponent of many epithelia and is expressed on epithelial cell surfaces, suggesting a 
role in the formation or maintenance of cell-cell contacts or in the polarization of epithe- 
lial cells. Levels of type XXIII collagen in patient samples have been proposed as a new 
biomarker of prostate cancer progression and recurrence. Type XXIII collagen has been 
detected in urine samples of prostate and non-small-cell lung cancer patients, and it is 
able to discriminate between controls and cancer patients, indicating a potential role as a 
new diagnostic tool in certain types of cancer. Currently there are no mutations in humans 
or mice associated with biological functions, and no biomarkers have been put forward. 


Type XXIII collagen is a type II transmembrane collagen consisting of an 
N-terminal noncollagenous domain containing a short cytoplasmic region and 
a transmembrane stretch followed by extracellular collagenous domains inter- 
rupted by noncollagenous sequences [1] (Fig. 3, Introduction). It is found in 
healthy human and mouse tissues of the lung, cornea, skin, tendon, and amnion 
and to a lesser extent in the kidney and placenta [2]. Currently the function of 
type XXII collagen is unknown. Immunofluorescence analysis using mouse anti- 
bodies has revealed that type XXIII collagen is a component of many epithelia 
and is expressed on epithelial cell surfaces, suggesting a role in the formation or 
maintenance of cell-cell contacts or in the polarization of epithelial cells. The 
proteins can either exist as full-length molecules anchored to a cell surface or as 
shed soluble molecules. Type XXIII collagen is cleaved from the cell surface by 
furin-mediated proteolysis and releases a 60-kDa soluble domain that forms a 
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multimeric complex [1,2]. At the cell surface of basal keratinocytes, type XXIII 
collagen directly interacts with integrin «261 and induces adhesion and spreading 
of keratinocytes [3]. 

Immunohistochemical analysis has revealed that type XXIII collagen is 
expressed in a variety of carcinoma tissues including prostate, breast, kidney, 
lung, and liver tumors [4]. The protein expression is enriched in the highly met- 
astatic rat prostate carcinoma cell line AT6.1 compared with nonmetastatic cell 
lines [1]. Levels of type XXII collagen in patient samples have been proposed 
as a new biomarker of prostate cancer progression and recurrence [5]. A study 
investigating the potential of type XXIII collagen as a novel lung cancer bio- 
marker also showed that high expression of type XXIII collagen correlates with 
shorter recurrence-free survival times in non-small-cell lung cancer (NSCLC) 
patients [4]. This correlation could be explained by the fact that type XXIII col- 
lagen facilitates the formation of pulmonary metastases in a mouse metastasis 
model by mediating initial adhesion to the epithelium, promoting cancer cell 
aggregation within blood vessels, or both [6]. 


BIOMARKERS OF TYPE XXIII COLLAGEN 


The furin-mediated cleavage of type XXIII collagen may be responsible for the 
release of protein into biological fluids such as urine and serum. Biomarkers of 
serum and urine origin have the advantages of being easy to collect, noninva- 
sive, and low cost and they could be used over time to track the course of the 
disease. Type XXIII collagen has been detected in urine samples of prostate and 
NSCLC cancer patients and is able to discriminate between controls and cancer 
patients [4,5], indicating a potential role as a new diagnostic tool in certain types 
of cancer. 


Type XXIII Collagen Description References 
Gene name and number COL23A1 NCBI gene 
ID: 91522 
Mutations with diseases NA 
in humans 
Null mutation in mice NA 
Tissue distribution in Lung, cornea, skin, tendon, and amnion and [2] 
healthy states to a lesser extent kidney and placenta 
Tissue distribution in NA 
pathologically affected 
states 
Special domains NA 
Special neoepitopes Collagen XXIII can be cleaved by furin from [1] 
the cell surface, generating an extracellular 
soluble molecule 
Protein structure and Consists of an N-terminal cytoplasmic [1] 
function domain, a transmembrane domain, 


and extracellular triple helical domains 
interrupted by short noncollagenous domains 
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Type XXIII Collagen Description References 
Binding proteins Integrin a2ß1 [3] 
Known central function The function remains unknown, but may be [2] 


implicated in the formation or maintenance 
of cell-cell contacts or in the polarization of 
epithelial cells 


Animals models with NA 
protein affected 
Biomarkers NA 


COL, Collagen; NA, not applicable; NCBI, National Center for Biotechnology Information. 
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SUMMARY 

Type XXIV collagen is a fibrillar collagen containing two collagenous domains and 
three noncollagenous domains. The amino-terminal peptide contains a thrombospondin 
N-terminal—like motif and a highly charged segment interspersed with several tyrosine 
residues. Type XXIV collagen is predominately expressed in the formation of bone; how- 
ever, it has recently been found in the brain, muscle, kidney, spleen, liver, lung, testis, and 
ovary, but to a far lesser extent than in bone. COL24A/ is increased during squamous cell 
carcinoma of the head and neck and could be a marker of osteoblast differentiation and 
certain types of cancers. Currently there are no mutations in humans or mice associated 
with biological functions, and no biomarkers have been put forward. 


Type XXIV collagen, encoded by the gene COL24A/, is a fibrillar collagen, 
closely related to types V and XI collagen [1]. Type XXIV collagen is composed 
of 1714 residues, with two collagenous domains (COL1 and COL2) and three 
noncollagenous domains (NCI—NC3), flanked by propeptide-like sequences 
[1-4]. The amino-terminal peptide contains a thrombospondin (TSP) N-terminal 
domain and a highly charged segment interspersed with several tyrosine residues 
(Fig. 2, Introduction) [1]. The major triple helix is slightly shorter than other fibril- 
lar collagens (991-997 residues) and contains two glycine substitutions and one 
G-X-Y imperfection [5,6]. The biological relevance of this imperfection is not 
yet revealed; however, interruptions in the major triple helical domain of classical 
fibrillar collagens are known to be a cause of disease [6,7]. The gene is localized 
to 1p.22.3, and structural considerations and tissue localization suggest a homotri- 
meric association of type XXIV collagen [1,6]. 

Type XXIV collagen is predominately expressed in the formation of the 
mouse embryo as well as in the bone and periosteum of a newborn mouse [2,8]. 
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It is represented together with types I and V collagen in bone, and is known 
as a marker of osteoblast differentiation and bone formation in mice [8,9]. In 
humans it was thought to be restricted to bone and cornea together with types 
I and V collagen [1,7]. However, it has also been found in the brain, muscle, 
kidney, spleen, liver, lung, testis, and ovary, but to a far lesser extent than in 
bone [2,3]. 

The biological role of collagen type XXIV is not revealed [10]. Neverthe- 
less, Col24al mRNA expression profiles have shown that COL24A1/ is increased 
during squamous cell carcinoma of the head and neck (HNSCC). This collagen 
could therefore be a useful marker of not only osteoblast differentiation but also 
prognosis of HNSCC and possibly identify patients at high risk for developing 
HNSCC [10]. 


Type XXIV Collagen Description References 
Gene name and number COL24A1, location 1p22.3 NCBI gene ID: 
255631 [11] 
Mutations with diseases NA 
in humans 
Null mutation in mice NA 
Tissue distribution in Predominantly in bone, but to a far (2,3,8] 
healthy states lesser extent in the brain, muscle, 
kidney, spleen, liver, lung testis, and 
ovary 
Tissue distribution in COL24A1 is increased during [10] 
pathologically affected HNSCC, but the tissue distribution 
states is not fully investigated 
Special domains TSP N-terminal-like domain [1] 
Special neoepitopes NA 
Protein structure and Collagen type XXIV is a fibrillar [1-3] 
function collagen closely related to types 
V and XI collagen, containing two 
collagenous domains and three 
noncollagenous domains. 
The amino-terminal peptide 
contains a TSP N-terminal-like 
motif and a highly charged segment 
interspersed with several tyrosine 
residues. 
Binding proteins NA 
Known central function This collagen is known as a marker [8] 
of osteoblast differentiation and bone 
formation. 
Animals models with NA 
protein affected 
Biomarkers NA 


COL, collagen; HNSCC, squamous cell carcinoma of the head and neck; NA, not applicable; 
NCBI, national center for biotechnology information; TSP, thrombospondin. 
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SUMMARY 

Type XXV collagen belongs to the collagen subgroup termed the membrane collagens. Type 
XXV collagen is primarily expressed in the neurons of the brain and to a lesser extent in the 
heart, testis, and eye in both humans and mice. Furin convertase is able to release a soluble 
collagen-like amyloidogenic component (CLAC) by proteolytic cleavage of COL25A1. 
CLAC has been isolated from Alzheimer-diseased brains and identified as a component 
of senile plaques, which are highly associated with Alzheimer disease. Genetic evidence 
between the COL25A/ gene and risk for Alzheimer disease has furthermore been estab- 
lished. COL25A/ transgenic knockout mice died instantly after birth due to respiratory fail- 
ure. Further observations showed that muscle motor axons failed to elongate and branch, 
followed by axon degeneration. These events led to excessive apoptosis, which suggests type 
XXV collagen is an important molecule. Currently no biomarkers have been put forward. 


Type XXV collagen, coded by the gene COL25A/, belongs to the collagen 
subgroup termed the membrane collagens, which also includes collagen types 
XIU, XVI, and XXII [1,2].Type XXV collagen is primarily expressed in the 
neurons of the brain and to a lesser extent in the heart, testis, and eye in both 
humans and mice [3,4]. 

The structure of type XXV collagen comprises an N-terminal intracellular 
noncollagenous domain (NC1), a transmembrane domain, and three extracel- 
lular collagenous domains (COL1-3), which are separated by four noncollag- 
enous domains (NC2-4). The configuration of mature type XXV collagen is a 
triple-helix structure that is stabilized by the collagen-like domains containing 
repetitive (Gly-X-Y) motifs [1,3,5,6]. 

Furin convertase is able to release a soluble collagen-like amyloidogenic com- 
ponent (CLAC) by proteolytic cleavage of COL25A1 [3]. CLAC has been isolated 
from Alzheimer-diseased brains and identified as a component of senile plaques, 
which are highly associated with Alzheimer disease [3,5]. CLAC binds to fibrilized 
amyloid B-peptides, resulting in the formation of partly protease-resistant aggregates 
[6]. Accumulation of these amyloid {-peptides leads to senile plaque formation. The 
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exact location of the amyloid f-peptide binding motif in CLAC has been identified 
as the LIKRRLIK sequence located within the NC2 domain [6]. 

A study in transgenic mice showed that overexpression of type XXV colla- 
gen was associated with increased expression of (-site amyloid precursor protein 
(APP)-cleaving enzyme 1 (BACE-1), which cleaves APP into amyloid f-peptide. 
Furthermore, accumulation of amyloid B-peptides was observed in the brains of 
mice, correlating with the increased levels of BACE-1, which indicates that type 
XXV collagen promotes amyloid plaque formation and contributes to the develop- 
ment of Alzheimer disease [7]. Genetic evidence between the COL25A/ gene and 
risk for Alzheimer disease has further been established [8]. Other diseases that are 
associated with mutations in the COL25A/ gene are antisocial personality disorder 
and congenital cranial dysinnervation disorder [3,4,8,9]. Hb9-GFP COL25A/ trans- 
genic knockout mice lacking the furin cleavage site died instantly after birth due to 
respiratory failure. Further observations showed that muscle motor axons failed to 
elongate and branch, followed by axon degeneration. These events led to excessive 
apoptosis, indicating the high importance of CLAC [10]. 


Type XXV collagen Description References 
Gene name and COL25A1 NCBI Gene ID: 
number 84570 
Mutations with An SNP in the COL25A1 gene is [3,4,8,9] 
diseases in humans significantly associated with ASPD. Three 


recessive mutations in type XXV have 
been identified to be a cause of CCDD. 
Type XXV collagen is expressed in the 
brains of patients with Alzheimer disease 
where the collagen binds to the fibrilized 
amyloid B-peptides. Genetic evidence 
indicates an association between the 
COL25A1 gene and risk of developing 
Alzheimer disease. 

Null mutation in mice Hb9-GFP Col25a1 transgenic knockout [10] 
mice have been generated by deleting 
exon 2 of the murine Co/25a7 gene, which 
contains the furin cleavage site. The mice 
died instantly after birth due to respiratory 
failure. Further observations showed that 
muscle motor axons failed to elongate and 
branch, followed by axon degeneration. 
These events led to excessive apoptosis. 

Tissue distribution in The expression of type XXV collagen [3,8] 

healthy states is highly brain specific, being present 
in the neurons of cerebral neocortices, 
hippocampus and other subcortical 
nuclei. Weak expression in the heart, 
testis and eye in both humans and mice. 
Type XXV collagen has also been located 
in conjunctival fibroblasts. 
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Type XXV collagen Description References 
Tissue distribution in CLAC is distributed in Alzheimer-diseased [3] 
pathologically affected brains and has been identified as an 
states amyloid plaque component. 
Special domains Proteolytic cleavage of full type XXV (3,5,6,11] 
collagen by furin convertase between the 
107-KIRIAR sequence and the 117-EAPSE 
sequence releases CLAC, which contains 
the extracellular collagen domains that 
are associated with senile plaques in 
Alzheimer-diseased brains. The amyloid 
B-peptide binding motif in CLAC has 
been located in the IKRRLIK sequence 
within the NC2 domain. 


Special potential The furin cleavage site of CLAC is a [3,6] 
neoepitopes potential neoepitope. 

Protein structure and Type XXV collagen is a 654-aa, brain- ([3,5-7,10,12] 
function specific type II transmembrane protein. 


The structure comprises an intracellular 
N-terminal noncollagenous domain 
(NC1), a transmembrane region and 
three repetitive extracellular Gly-X-Y 
collagen-like domains (COL1-3) with 

X and Y being frequently proline and 
hydroxyproline residues. The COL 
domains are interrupted by three 
additional noncollagenous domains 
(NC2-4). The furin convertase cleavage 
site is located between Arg-112 and Glu- 
113, which is able to release the soluble 
CLAC fragment. 

The membrane collagens are in 
general involved in cell-cell and 
cell-matrix adhesion. CLAC binds 
specifically to amyloid B-peptides and 
facilitates their assembly into protease 
resistant aggregates, thus playing a 
role in the development of Alzheimer 
disease. 

Binding proteins Secreted and membrane-bound forms of [3] 
type XXV collagen bind specifically to the 
fibrilized forms of amyloid B-peptide. 

Known central function Type XXV collagen is involved in [8,11] 
fibrillization and cell toxicity and protects 
against proteolysis of amyloid B-peptides 
in Alzheimer-diseased brains. 

Animal models with Hb9-GFP Col25a1 transgenic knockout [7,10] 


protein affected mice. Overexpressed COL25A 1 
transgenic mice. 
Biomarkers N.A. 


COL, Collagen; NCBI, National Center for Biotechnology Information; SNP, single nucleotide 
polymorphism; ASPD, antisocial personality disorder; CCDD, congenital cranial dysinnervation 
disorder; CLAC, collagen-like amyloidogenic component; aa, amino acid; N.A., not applicable. 
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SUMMARY 

Type XXVI collagen contains collagenous domains in its structure, but does not prop- 
erly fit within any of the collagen family subgroups. Trimers of type XXVI collagen are 
formed from the intermolecular disulfide bonds between the NC1 regions. Such trimeriza- 
tion formation is a special characteristic of type XXVI collagen compared with fibrillar 
collagens, where trimer formation is associated with the C-propeptide region. Type XX VI 
collagen is expressed in the testis and ovary of adult tissues. The highest levels are found 
in the reproductive tissues of neonates. Consequently, type XX VI collagen has been sug- 
gested to be associated with generation and modeling of tissues. The research on type 
XXVI collagen is only now emerging, and no biomarkers or mutations associated with 
this protein have been discovered. 


Type XXVI collagen, encoded by the gene COL26A1/, is a 441-amino acid protein 
encoded by the emilin/multimerin domain containing protein 2 on chromosome 
7q22.1 [1]. Type XXVI collagen contains collagenous domains in its structure, 
but does not properly fit within any of the collagen family subgroups. 

The structure of type XXVI collagen comprises two collagenous domains 
(COLI and COL2) consisting of Gly-X-Y repeats and three noncollagenous 
domains (NCI-3). It has been shown that type XXVI collagen forms trimers, 
accumulates in the extracellular matrix of mouse tissues and is secreted in vitro [2]. 

The NC1 domain contains a putative endoplasmic reticulum-targeting signal 
sequence and an emilin domain, which consists of 13 cysteine residues [2,3]. Type 
XXVI collagen trimers are formed from the intermolecular disulfide bonds between 
the NC1 regions. Such trimerization formation is a special characteristic of type 
XXVI collagen compared with fibrillar collagens, where trimer formation is associ- 
ated with the C-propeptide region [2]. NC1 contains a furin protease cleavage site 
(RRRR) that is also present in the transmembrane collagens such as types XIII and 
XXV collagen. The cleavage site is located just before the COL! domain, but it does 
not undergo furin cleavage as do the transmembrane proteins. 
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Type XXVI collagen is expressed in the testis and ovary of adult tissues. The 
highest levels are found in the reproductive tissues of neonates. It has been shown 
that during early testis and ovary development, type XXVI collagen is highly 
expressed in myoid cells and pretheca cells in the testis and ovary, respectively, 
causing accumulation of type XXVI collagen in these tissues [2]. The functional 
role of type XXVI collagen has therefore been suggested to be in the early devel- 


opment of testis and ovary as an extracellular matrix component [2]. 


Type XXVI collagen 


Gene name and number 


Mutations with diseases 
in humans 

Null mutation in mice 
Tissue distribution in 
healthy states 


Tissue distribution in 
pathological affected 
states 

Special domains 


Special neoepitopes 
Protein structure and 
function 


Binding proteins 


Known central function 


Animal models with 
protein affected 
Biomarkers 


Description 
COL26A1 


N.A. 


N.A. 

Type XXVI collagen is expressed in the 
estis and ovary in adult tissues. Highest 
levels are found in the reproductive 
issues of neonates 

N.A. 


The NC1 domain contains a putative 
ER-targeting signal sequence, and an 
EMI domain, which is made up by 13 
cysteine residues. The NC1 domain 
also contains a putative furin protease 
cleavage site (RRRR) 

N.A. 

The structure comprises two collagenous 
domains (COL1 and COL2) consisting 
of Gly-X-Y repeats and three 
noncollagenous domains (NC1-3). 

The NC1 domain contains a putative 
ER-targeting signal sequence, an EMI 
domain and a furin protease cleavage 
site. It has been shown that type XXVI 
collagen forms trimers, accumulates in 
the extracellular matrix of mouse tissues 
and is secreted in vitro 

HSP47, a collagen-specific molecular 
chaperone that binds to the triple helical 
domain 

Type XXVI collagen has been suggested 
to be involved in the early development 
of testis and ovary as an extracellular 
matrix component 

N.A. 


N.A. 


References 
NCBI Gene ID: 


136227 


[2,3] 


[2] 


[2,3] 


[2] 


[2] 


COL, Collagen; NCBI, National Center for Biotechnology Information; ER, endoplasmic reticulum; 
EMI, emilin; N.A., not applicable. 
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SUMMARY 

Type XXVII collagen is a fibrillar collagen; however, the molecular structure is dif- 
ferent from that of other fibrillar collagens: type XX VII collagen forms characteristic 
nonstriated fibrils, its triple helix domain is shorter than that of other fibrillar collagens 
and it presents two interruptions, and it lacks the characteristic N-terminal telopep- 
tide-like region and the minor triple helical domain. This suggests that the function of 
this fibrillar collagen is different from that of other fibrillar collagens. In adults, type 
XXVII collagen is primarily expressed in cartilage. During development, the expres- 
sion is particularly high in developing bones, and mutations in mice lead to chondro- 
dysplasia. In humans the only disease associated with certainty with mutations in the 
type XXVII collagen gene is Steel syndrome, which is characterized by dislocated 
hips, prominent forehead and a long oval-shaped face, short stature, scoliosis, and other 
skeletal abnormalities. 


Type XXVII collagen is a fibrillar collagen encoded by the gene COL27A1, which 
was first identified by Pace et al. in 2003 [1]. The human gene is localized on 
chromosome 9q32, whereas the mouse homologous gene is located on chromo- 
some 4. Its sequence is particularly conserved in human, mouse, and fish. Most 
of the research done so far on type XX VII collagen has been performed in mouse 
or zebrafish [2—6]. 

As with other fibrillar collagens, type XX VII collagen contains an N-terminal 
and a C-terminal propeptide. The full-length procollagen is 1860 amino acids 
long, consisting of a 41-amino acid predicted signal peptide, the N-terminal 
propeptide (583 amino acids) that contains a thrombospondin (TSP) domain, 
the collagenous triple-helix domain (994 amino acids), and the C-terminal 
propeptide (242 amino acids) [1]. Together with type XXIV collagen, the 
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type XXVII collagen gene has been defined as belonging to a third class 
(type C) within the family of fibrillar collagens [7,8]. Its molecular structure 
is different from that of other fibrillar collagens because type XXVI 
collagen forms characteristic nonstriated fibrils whose width has been observed 
to vary from 10 to 80nm according to the preparation method used [6,9]. 
The type XX VII collagen triple-helix domain is shorter than those in the 
other fibrillar collagens, and it presents two interruptions. Moreover, it lacks 
the fibrillar collagen characteristic N-terminal telopeptide-like region and 
the minor triple helical domain [6,9]. 

As with types II and XI collagen, type XXVII collagen is primarily 
expressed in cartilage in adult organisms. Its expression is regulated by fac- 
tors SOX9 and Lc-Maf in chondrocytes [5,10]. During embryonic and fetal 
development, the expression of COL27A1 in mice is more broadly localized 
in the anlagen of endochondral bone [6]; in the developing lungs, ears, and 
colon [1]; in the retina and in the cornea of the eye; and in the major arteries 
of the heart [9]. This suggests that type XX VII collagen plays a key role in the 
developmental phases [9]. Its role in cartilage during fetal development has 
been studied. COL27A1 is expressed at 18-20 weeks of gestation in human 
fetal epiphyseal cartilage [11]. Type XX VII collagen is mainly deposited in 
skeletal tissues at sites of transition from cartilage to bone. In developing 
endochondral bone, type XX VII collagen plays a role in the transition of car- 
tilage to bone during skeletogenesis [6]. Its expression is particularly high in 
the matrix surrounding proliferative chondrocytes in the epiphyseal growth 
plate [4]. To better understand the role of type XX VII collagen, deletion and 
mutation knockin mice have been generated [4]. Mice carrying a heterozygous 
87-amino acid deletion in the collagenous domain survived into adulthood, 
but with growth impairments, whereas homozygous mice showed skeletal 
abnormalities, severe chondrodysplasia, and died at birth due to respiratory 
problems [4]. Mutant experiments in zebrafish have shown that knockdown in 
zebrafish embryos of COL27A1A alone and in combination with COL27A1B 
resulted in delayed and decreased vertebral mineralization, morphological 
abnormalities, and scoliosis [2]. 

In humans the only disease associated with certainty with mutations in 
the type XXVII collagen gene is Steel syndrome, characterized by dislo- 
cated hips, prominent forehead and a long oval-shaped face, short stature, 
scoliosis, and other skeletal abnormalities [12]. A mutation responsible 
for the syndrome has been identified in the homozygous missense variant 
Gly697Arg [12]. Moreover, it has been suggested that mutations influenc- 
ing the structure and properties of tenascin-C and type XXVII collagen 
could possibly lead to a risk for developing Achilles tendinopathy [13]. Pre- 
liminary evidence has been presented showing that single-nucleotide poly- 
morphisms in COL27A1 can be involved in the development of Tourette 
syndrome [14]. 


Type XXVII Collagen Chapter | 27 157 


Type XXVII Collagen Description References 
Gene name and number COL27A1, located on chromosome NCBI gene ID: 
9q32 85301 
Mutations with diseases in Steel syndrome; Tourette syndrome; [12-14] 
humans Achilles tendinopathy 
Mutations in mice Homozygous deletion of 87 [4] 
aa in the collagenous domain. 
Skeletal abnormalities, severe 
chondrodysplasia, and death at birth. 
Tissue distribution in healthy Adults: cartilage. Children and [1,6,9,11] 
states adolescents: epiphyseal growth 
plate. During embryonic and fetal 
development: endochondral bone, 
lungs, ear, colon, retina, cornea, and 
major arteries of the heart. 
Tissue distribution in NA 
pathological affected states 
Special domains TSP domain in the N-terminal [1] 
propeptide 
Special neoepitopes NA 
Protein structure and N-terminal propeptide (583 aa), 
function collagenous triple-helix domain (994 
aa), and C-terminal propeptide (242 aa). 
Binding proteins NA 
Known central function Type XXVII collagen may play a role [1] 
during endochondral bone formation, 
including calcification and degradation 
of cartilage 
Animal models with protein Knockdown of COL27A1A alone [2] 


affected 


Biomarkers of that protein 


and in combination with COL27A1B 
in zebrafish embryos: delayed and 
decreased vertebral mineralization, 
morphological abnormalities, and 
scoliosis. 

NA 


aa, amino acid; COL, collagen; NA, not applicable; NCBI, national center for biotechnology 
information; TSP, thrombospondin. 
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SUMMARY 

The human and murine type XVVIII collagen structurally resembles collagen type VI. 
Type XXVIII collagen is mainly located in peripheral nerves surrounding most non- 
myelinating glial cells and dorsal root ganglia, but it is also present in skin calvaria, at 
the nodes of Ranvier, and as a component of the peripheral nervous system nodal gap. 
Interestingly, type XXVIII collagen is overexpressed in bleomycin-induced lung and 
in mouse hepatocarcinoma, suggesting it is involved in damage repair processes. Type 
XXVIII collagen has been located in murine lungs associated with basement membranes, 
albeit the expression is not restricted to this membrane. The research on type XXVIII 
collagen is just emerging. Currently there are no mutations in humans or mice associated 
with biological functions, and no biomarkers have been put forward. 


The COL28A/ gene on human chromosome 7p21.3 and on mouse chromosome 
6A1 encodes collagen XXVIII with very restricted expression on tissues. Type 
XXVIII collagen is mainly located in peripheral nerves and surrounds all nonmy- 
elinating glial cells (with the exception of type II terminal Schwann cells in the 
hairy skin) and dorsal root ganglia. It is also present in skin and calvaria [1—4] and 
can be found at the nodes of Ranvier and is a component of the peripheral nervous 
system nodal gap [1]. 

Type XXVIII collagen mRNA is expressed in newborn sciatic nerves, but 
not in adults, although the protein persists into adulthood, suggesting a long 
half-life for the molecule [1,5]. Col28A1 was weakly expressed in the frontal 
cortex, heart, liver, adrenal gland, gallbladder, pancreas, esophagus, colon, rec- 
tum, urinary bladder, and prostate of adults [6]. 

Type XXVIII collagen has been also detected in the murine lung associated 
with basement membranes around vessels, airways, and alveoli [7]. Further- 
more, in zebrafish studies, type XXVIII collagen (Col28ala and Col28a2b) is 
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mainly expressed in the skeletal muscle and in the skin, making it likely to be 
implicated during the developmental stage. Lastly, there is a unique expression 
of Col28ala and Col28a2b in the liver and thymus that suggests the need for 
further analysis as the expression in mammals can be broader than acknowl- 
edged at present [1]. 

The human and murine type XVVIII collagen structurally resembles type VI 
collagen. It consists of two von Willebrand factor A (vWA) domains flanking 
a 528-amino acid collagenous domain. There are 16 uniformly arranged, very 
short imperfections (GIG and G4G) in the Gly-X-Y repeat, which is unique 
among the known collagens [3,8]. The triple helical part is longer than that in 
type VI collagen. The C terminus comprises a unique domain, nonhomologous 
to previously known collagen domains, and a Kunitz family serine protease 
inhibitor module similar to that in the vWA domain. Type XXVIII collagen 
forms dimers as shown by electrophoresis in composite polyacrylamide/agarose 
gels under nonreducing conditions [2,5]. 

Type XXVIII collagen is particularly associated with unmyelinated fibers, 
with the nodes of Ranvier of myelinated axons and with terminal Schwann cells 
in sensory end organs. Worth noting is the enhanced expression of type XXVIII 
collagen in a mouse model of Charcot—Marie—Tooth disease, a disease that is 
characterized by dysmyelinated nerve fibers, further confirming the correlation 
between absence of myelin and presence of type XXVIII collagen [1,3]. Type 
XXVIII collagen thus shows promise as a marker of nonmyelinated zones of the 
peripheral somatosensory system. 

Type XXVIII collagen was found in very low levels in healthy lung tissue, 
but it was overexpressed in bleomycin-induced lung injury, defined as a strong 
staining with a patchy appearance in fibrotic foci that did not colocalize with 
a-smooth muscle action—positive myofibroblasts. This indicates that a subset of 
cells specifically expressing type XXVIII collagen can be involved in the tis- 
sue repair process [7]. In addition, upregulation of Col28al was also observed 
in mouse hepatocarcinoma [6], which could point to a role of Col28al in the 
fibrotic stage of this disease. 


Type XXVIII collagen Description References 
Gene name and number COL28A1 
Mutations with diseases in N.A. 
humans 
Null mutation in mice N.A. 
Tissue distribution in Mainly located in peripheral nerves [1-4] 
healthy states surrounding most nonmyelinating glial 
cells and dorsal root ganglia. Also present 
in skin calvaria, at the nodes of Ranvier. 
Component of the PNS nodal gap 
Tissue distribution in Overexpressed in bleomycin-induced lung [6,7] 


pathological affected states 
Special domains 
Special neoepitopes 


and in mouse hepatocarcinoma 
None 
None known 
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Type XXVIII collagen Description References 
Protein structure and Homotrimeric molecule with its a chain [3] 
function composed of a 528-amino acid collagenous 


domain flanked by two vWA modules, 
which are known to be involved in protein- 
protein interactions. The C terminus consists 
of a module without homology to any 
known domain, followed by a Kunitz family 
serine protease inhibitor module 


Binding proteins Forms dimers {1] 
Known central function Structural and functional role in the PNS [3] 
Animals models with N.A. 


protein affected 
Biomarkers of that protein N.A. 


COL, Collagen; N.A., not applicable; PNS, peripheral nervous system; vWA, von Willebrand factor A. 
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SUMMARY 


Laminins are a major constituent of the basement membrane which is an intricate mesh- 
work of proteins separating the epithelium, mesothelium, and endothelium from connec- 
tive tissue. There are 15 different laminins, each consisting of a unique combination of 
three subchains. The combination of chains confers some tissue specificity. Laminins are 
essential for the function of the basement membrane as most null mutations are lethal. 
Just as collagens, laminins are structural proteins with a helical region formed by heptads. 
The heptad has a less strict organization compared to collagens, where the helical domain 
consists of a strict building block of three amino acids. Laminins carry out a central role 
in organizing the complex interactions of the basement membranes. This is seen through 
the wide range of interaction partners such as syndecan, nidogen, collagen, integrins, dys- 
troglycan, and heparin. Mutations in different laminin chains are associated with diseases 
such as Alport syndrome, epidermolysis bullosa, and muscle dystrophies. The laminin 
family is large, and biomarkers for measuring laminins are emerging. The essential role of 
laminins in the basement membrane can be summarized by the following: Even the most 
exceptional construction will not last on a poor foundation. 
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Laminins are a major constituent of the basement membrane, a thin layer of 
specialized extracellular matrix (ECM) that surrounds epithelia, endothelia, 
muscle cells, fat cells, Schwann cells, peripheral nerves, and the entire cen- 
tral nervous system. The basement membrane is an intricate meshwork com- 
posed of laminins, collagen IV, nidogens, and sulfated proteoglycans [1,2]. 
Even though the basement membrane consists of the same proteins, different 
isoforms of these combine to form a structurally and functionally diverse base- 
ment membrane. 

Laminins are multidomain proteins and are integral parts in the matrix of the 
basement membrane. During early development of mice embryos, the presence 
of laminin is sufficient for the formation of basement membrane-like structures 
in the absence of the other major basement membrane protein, type IV collagen 
[3]. Structurally, laminins can generally be viewed as intertwined, heterotrimeric 
glycoproteins of one of three types: cross shaped, Y shaped, or rod shaped [4]. Each 
laminin is made up of an a, §, and y chain (Table 29.1). The formation of laminin 
trimers seems to depend on the C-terminal portions of their coiled regions [5]. The 
difference in structure of the heterotrimeric laminin glycoproteins is mainly through 
differences in the a chains that give rise to the three structural versions mentioned 
above. One unique feature of all laminin a chains is that they contain a large C-ter- 
minal globular (G) domain with five modules known as laminin G (LG) domains 
[6-8]. Currently, five distinct a, three 6, and three y laminin chains have been iden- 
tified, and together they form at least 15 isoforms of which 15 are presented in the 
following sections (see Table 29.1) [9,10]. Contradicting evidence for the existence 
of the laminin 64 chain has been presented, and it remains controversial whether it 
has been found in vivo, so this chain is not discussed in this chapter. 

During development, different laminins are present in various tissues. As 
part of the maturation of organs, these early laminins are often replaced by 
other laminins. This phenomenon ultimately produces tissue-specific laminins 
that often depend on the type of laminin a chain. One laminin that defies this 
general pattern is laminin-8 (a4ß1y1) that is expressed by all endothelial cells 
regardless of their stage of development [11]. 

A wide range of proteins interact with laminins. Among these proteins are cell 
surface receptors. The cell surface receptors that interact with laminins can be 
separated into integrins and nonintegrins. For example, integrins that have been 
shown to interact with laminins-10 and -11 include 361, a6ß1, and a6ß4 [12]. 
Currently, three different classes of nonintegrin cell surface receptors are known. 
Two of these classes, dystroglycan and syndecan, are proteoglycans, and among 
the latter class, Lutheran blood group glycoprotein (also known as basal cell adhe- 
sion molecule), is a member of the immunoglobulin superfamily. The extracellular 
a chain of dystroglycan primarily binds to the G domains of laminin a1 and a2, 
and with less affinity for other G domains of other a chains [13—15]. Interactions 
of the specific laminin chains can be seen in the tables in the following sections. 

In an attempt to increase the clarity of the chain composition of laminins, 
Aumailley et al. [16] came up with a new nomenclature that is being increasingly 
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TABLE 29.1 Laminin Chain Composition, Simplified Nomenclature, and 
Sites of Expression 


Alternative Major Site(s) of 
Protein Chains Nomenclature Expression 
Laminin-1 alplyl Laminin-111 Developing epithelia 
Laminin-2 a2ß1y1 Laminin-211 All myogenic tissues, 


peripheral nerves, 
mesangial matrix of 


glomerulus 

Laminin-3 alp2y1 Laminin-121 Myotendinous junctions 

Laminin-4 a2p2y1 Laminin-221 Neuromuscular junction, 
mesangial matrix of 
glomerulus 

Laminin-5 a3ß3y2 Laminin-332 Epidermis 

Laminin-6 a3ß1y1 Laminin-311 Epidermis 

Laminin-7 a3ß2y1 Laminin-321 Epidermis 

Laminin-8 a4ß1y1 Laminin-411 Endothelium, adipose 


tissue, peripheral nerves 


Laminin-9 a4ß2y1 Laminin-421 Endothelium, smooth 
muscle, neuromuscular 
junction, mesangial matrix 
of glomerulus 


Laminin-10 a5ß1y1 Laminin-511 Mature epithelium, mature 
endothelium, smooth 
muscle, peripheral nerves 


Laminin-11 a5ß2y1 Laminin-521 Mature epithelium, mature 
endothelium, smooth 
muscle, neuromuscular 
junction, glomerular 
basement membrane 


Laminin-12 a2ß1y3 Laminin-213 Surface of ciliated epithelia 

Laminin-13 a3ß2y3 Laminin-323 Central nervous system, 
retina 

Laminin-14 a4ß2y3 Laminin-423 Central nervous system, 
retina 

Laminin-15 a5p2y3 Laminin-523 Central nervous system, 


retina 
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used in the laminin field. In this new nomenclature, laminin-1 would be called 
laminin-111, which is more explanatory than the original name because it 
reveals the composition of chains (see Table 29.1). 

In the following sections, select findings are presented on each of the 
laminins. 


LAMININ-1 


Laminins-1 and -10 seem to be the only two laminins expressed in early postim- 
plantation embryos. Laminin-| is the core laminin in the Reichert membrane, an 
extraembryonic basement membrane [17]. Because the laminin a1 chain inter- 
acts with dystroglycan and the absence of either leads to early lethality during 
development, both seem to be essential for structural integrity of the Reichert 
membrane. The importance of laminin composition in basement membranes 
was evaluated by Miner et al. [17], who showed that the laminin a1, 61, and y1 
chains were necessary for development. In mice lacking either of these chains, 
the embryos died early and no gastrulation occurred [17]. The phenotype of 
the mice could partially be rescued by overexpression of the laminin a5 chain 
through a transgene, but the Reichert membrane and blood sinuses were not 
formed [17]. An in vitro study showed that laminin-1 was able to cause dif- 
ferentiation of pluripotent stem cells into ectoderm [18]. However, even though 
laminin-1 is present in all epithelia during development, it is mostly replaced by 
other laminins as the epithelium matures. One example of this replacement is 
seen during glomerulogenesis. During development of nephrons, epithelial cells 
adjacent to the capillaries of the glomerulus differentiate into podocytes that 
help establish the glomerular filtration barrier [19]. The podocytes lie on a base- 
ment membrane, and the endothelial cells produce a separate basement mem- 
brane. During maturation of the glomerulus, these two basement membranes 
fuse to form the glomerular basement membrane (GBM) [20]. Throughout the 
early stages of GBM maturation, laminin-1 is present, but it is later replaced by 
laminins-10 and -11. In the final stages of maturation, laminin-10 disappears, 
leaving laminin-11 as the sole laminin in the GBM [21]. 

One of the cellular effects of laminin-1 is seen in experiments with cells of 
the dorsal root ganglia. In the presence of laminin-1 in the matrix, interactions 
with «361 and a7ß1 integrins on the cells lead to neurite outgrowths [22]. It 
thus seems that the signaling induced by laminin-1 is mediated through interac- 
tion with other proteins such as cell surface receptors that relay information and 
ultimately alter gene transcription. 


LAMININ-2 


The major laminin in skeletal muscle is laminin-2 [23]. The integrity of 
muscle cells is ensured by the dystrophin—glycoprotein complex that medi- 
ates crucial linkages [24]. It is believed that a-dystroglycan binds the G 
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domain of the laminin a2 chain, thus stabilizing the basement membrane and 
enabling laminin polymerization [13—15,25]. Another important interaction 
is seen at myotendinous junctions where the laminin a2 chain interacts with 
a7ß1 integrin. Absence of the integrin a7 subunit leads to a form of muscular 
dystrophy [26,27]. 

In the neuromuscular system of mammals, seven laminin chains are 
expressed (al, a2, a4, a5, B1, B2, and y1) that produce laminins-1, -2, -4, 
-8, -9, -10, and -11. In-depth discussion of the distribution of these laminins 
in the various basal laminae is beyond the scope of this chapter, but it has 
been reviewed previously [23,28]. In Schwann cells of peripheral nerves, lam- 
inin-2 is present in the basal lamina [29]. The interaction of laminin-2 with 
a-dystroglycan is also seen in axons of the peripheral nervous system where 
it is important for degeneration and regeneration of axons. The expression 
of the laminin a2 chain correlates with levels of the cytoplasmic domain of 
6-dystroglycan [30]. With increasing axonal deterioration, immunoreactivity 
of both the laminin a2 chain and B-dystroglycan decreased, but progressively 
increased during regeneration [30]. Based on these findings, it was speculated 
that the interaction between the dystroglycan complex and laminin-2 enables 
myelin formation. 

Laminin-2 has also been found to be part of the mesangial matrix of the 
kidney in mice and humans, but not in rats [21,31—33]. 


LAMININ-3 


The myotendinous junctions are specialized sites for the transmission of force 
between the muscle and tendon. During formation of myotendinous junctions, 
a special basement membrane is formed where laminins al and a5 chains (ie, 
LAMAI and LAMAS) are specifically localized. Later during development, 
the laminin 62 chain (LAMB2) is expressed, and LAMAS becomes distributed 
along the entire myotube. Based on the codistribution of the laminin chains, it 
was speculated that a switch from laminin-1 to laminin-3 occurs during devel- 
opment of the myotendinous junction [34]. 


LAMININ-4 


Although laminin-2 is the major laminin present in skeletal muscle, laminin-4 is 
also present and resides in the neuromuscular junction [23]. It has been shown 
that mice lacking LAMB2 have severe functional defects in the neuromus- 
cular junction and die within a few weeks after birth [35]. Through studies it 
has become clear that this effect is mediated by an apparent repellent effect of 
LAMB2 on the Schwann cells [36]. In the presence of LAMB2, the Schwann 
cell acted normally, but in its absence, the Schwann cell wrapped the nerve ter- 
minal in membrane extensions that obstructed the synaptic cleft and blocked the 
passage of the neurotransmitter from nerve to muscle. 
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Little evidence exists for the presence of laminin-4 in the mesangial 
matrix of the kidney. However, in comparison to humans and mice, which 
have laminin-2 in their mesangial matrix, this matrix contains laminin-4 in 
rats [21,31-33]. 


LAMININS-5, -6, AND -7 


In general, epithelial tissues line all surfaces facing the outer environment and 
include the skin, oral cavity, lung, hair follicles, many glands, and the gastroin- 
testinal and urinary tracts. All epithelial cells are polarized and their basolateral 
surface rests on a basement membrane. Epithelial basement membranes contain 
a high level of laminin-5 that interacts with a range of different proteins, such as 
the «664 integrin, and the hemidesmosome through interaction with BP-180 and 
intracellular plectin, both of which are part of the hemidesmosome [28]. Attach- 
ment of the basement membrane to the underlying stroma is partially achieved 
through the interaction of laminin-5 with type VII collagen, which forms part of 
the anchoring fibrils that extend into anchoring plaques in the stroma [37-39]. 
The epithelial basement membrane zone consists of a range of structures main- 
taining the stable association of the epidermis and the dermis. Among others, 
these structures include the aforementioned hemidesmosomes and anchoring 
fibrils. Combined, these structures form a highly interconnected complex, thereby 
securing correct organization of the layers of the skin. If one, or more, of the pro- 
teins in this meshwork are altered, eg, due to genetic alteration, the skin becomes 
fragile, often leading to one of the types of epidermolysis bullosa [40]. 

The epithelial basement membrane also contains laminin-6 and, to a lesser 
extent, laminin-7. Through studies of the epithelial basement membrane of the 
amnion, it seems that laminins-6 and -7 form a covalent complex with laminin-5 
[41]. Because the basement membrane of the amnion resembles that of the skin, 
this interaction is likely to be present in the skin. During mechanical stimuli, 
laminin-6 has been shown to be involved in signal transduction through com- 
plexes formed with perlecan and dystroglycan [42]. 


LAMININ-8 


Laminin-8 (a461y1) is expressed by all endothelial cells and is present in the 
basement membrane of the vasculature, along with other laminins such as -9, 
-10, and -11. The crucial role of the laminins containing the laminin a4 chain 
(LAMA4) in the basement membrane of the endothelium, and their role in 
microvessel formation, was shown through deletion of the chain in mice [43]. 
Both during the embryonic and neonatal period, Lama4 null mice presented with 
diffuse bleeding in subcutaneous fat and also intramuscularly [43]. Mutation of 
Lama4 led to delayed deposition of nidogen and collagen IV into the basement 
membrane of the capillaries, thus causing discontinuities. The mutation, how- 
ever, only led to a minor increase in neonatal mortality [43]. Laminin-8 has also 


Laminins Chapter | 29 169 


been implicated in lymphocyte physiology; it has been shown to be synthesized 
by lymphoid cells. It also promotes lymphocyte migration and is able to stimu- 
late T-cell proliferation in concert with CD3/T cell receptor engagement [44]. 

Differentiation of adipocytes from a preadipocyte cell line led to an increased 
expression of mRNA corresponding to laminin-8 and was thus judged to be part 
of the ECM of adipose tissue [45]. To determine the effect of LAMA4 abla- 
tion on weight gain and adipose tissue function, a study was conducted in mice 
deficient in Lama4 (Lama4~'-) and compared with wild type. In the absence 
of LAMA4, mice had decreased epididymal adipose tissue mass and defective 
lipogenesis, but no differences were observed in subcutaneous adipose tissue 
[46]; thus, it was suggested that LAMA4 affects adipose structure and function 
in a deposit-specific manner. 

In the peripheral nervous system, axons are covered with laminins containing 
the laminin 61 chain (LAMB1) [23]. This covering is provided by laminins-2, 
-8, and -10. Through knockout studies, it became clear that LAMA4-containing 
laminins (eg, laminins-8 and -9) are important for Schwann cell maturation. 
Whereas heterozygous animals had partial demyelination, animals deficient in 
LAMA4 had an almost complete amyelination and severe neuropathy [47,48]. 


LAMININ-9 


Laminin-9 is present in the neuromuscular junction [23]. As with laminin-4, 
the absence of LAMB2 leads to severe muscular defects due to the hindrance 
of neurotransmitter deliverance to muscle [35,36]. Furthermore, just as absence 
of LAMA4 leads to improper localization of synapses; it seems that LAMB2 is 
also involved in guiding proper localization of synapses [49]. 

A study of mesangial cells suggested that the presence of laminin-9 is nec- 
essary for migration induced by insulin-like growth factor binding protein-5 
[50]. The migration was inhibited by antibodies that blocked the attachment to 
a6ßl integrins. Antisense inhibition of laminin-9 expression led to absence of 
the integrin on the cell surface [50]. These findings suggest that the presence of 
laminin-9 in the mesangium is necessary for development. 

As mentioned for laminin-8, laminins containing LAMA4 (ie, laminins-8 
and -9) are required for correct microvessel formation [43]. In the absence of 
LAMA4, mice developed diffuse and extensive bleeding in subcutaneous fat 
and also intramuscularly [43]. The deposition of nidogen and type VI collagen 
into the basement membranes was delayed, which suggests a vital role for the 
LAMA4 chain in correct basement membrane formation in the endothelium. 


LAMININ-10 


In a study performed by Pouliot et al. [51], the LAMAS chain of laminins-10 
and -11 was shown to be abundantly expressed in the basement membrane 
underlying the interfollicular epidermis and in the blood vessels in the dermis 
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of humans [51]. Although laminin-5 expression did not change with age in the 
basement membrane underlying the epidermis, quantities of laminins-10 and 
-11 both seemed to decrease in adult skin [51]. It was also proven that these 
laminins were potent adhesive substrates for keratinocytes, with the adhesion 
being mediated by interactions with integrins 0361 and 64, and that the lami- 
nins stimulated keratinocyte proliferation and migration [51]. 

During hair germ elongation, it has been shown that there is a reduced 
expression of laminins-1 and -5 [52,53]. Because laminin-10 is present in 
the basement membrane of elongating hair germs, whereas other laminins 
are downregulated, Li et al. [54] proposed that laminin-10 was necessary for 
hair follicle development. To prove this hypothesis, they generated LAMA5- 
deficient mice and showed that their skin contained fewer hair germs than 
controls and that the skin presented with lower levels of early hair markers 
such as sonic hedgehog and Glil. After transferring skin grafts to nude mice, 
the lack of hair development could be reversed by the addition of purified 
laminin-10, thus underlining the vital role of this laminin in hair development 
[54]. In summary, it seems that laminin-10 is necessary for hair follicle epi- 
thelial invagination (ie, papillae formation), but is not required for formation 
of papilla cells, indicating that this laminin is part of a scaffold necessary for 
the generation of papillae [55]. 

The pleura of the lungs are made up of mesothelial cells and an underlying 
basement membrane containing laminin-10. Null mutation of LamaS leads to 
breakdown of the basement membrane with a resulting inability to separate the 
four lobes of the right lung [56]. Laminin-10 is also present in various areas 
of vasculature. In embryos lacking LAMAS, only minor vascular defects were 
observed. However, the absence of LAMAS in mice led to other severe defects 
such as exencephaly, syndactyly, and placentopathy of the embryos [57]. In 
contrast to the Lama5 mutation, absence of either LAMB1 or the laminin y1 
chain (LAMC1) causes early death of the embryo because no basement mem- 
brane is able to form [17,58]. 

As mentioned for laminin-1, laminin-10 is present during development of 
the GBM [21]. However, during maturation of the GBM it is gradually replaced 
by laminin-11. Laminin-10 is still found extensively in the mature kidney, but 
it is localized to the basement membranes of the tubules and collecting ducts 
[2131]; 

A potential effect of laminin-10 during development of neurons was seen in 
an in vitro study of cells from dorsal root ganglia of mice [22]. It was shown 
that laminin-10 bound integrin a6ß1 and that this lead to outgrowth of neurites, 
independently of the addition of the trophic factor neurotrophin [22]. 


LAMININ-11 


Laminin-1!1 is present in the neuromuscular junctions [23]. In the synaptic basal 
laminae, LAMA5 and LAMB2 are present, making laminin-11 the prominent 
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laminin of synapses [29,32]. In the absence of LAMB2, severe muscular defects 
are observed as the delivery of the neurotransmitter is obstructed by membrane 
extensions of Schwann cells [35,36]. The presence of laminin-11, along with 
other laminins containing LAMB2, at the neuromuscular junctions thus seems 
to ensure the integrity of muscle innervations. 

As stated for laminin-1, during the early phases of glomerulogenesis lam- 
inin-1 is present, but it is gradually replaced by laminins-10 and -11. During 
the final maturation of the glomerulus, laminin-10 levels decrease, leaving lam- 
inin-11 as the sole laminin of the GBM [21]. Immunoelectron microscopy of 
developing mouse kidneys showed that laminins-1 and -11 were produced by 
both endothelial cells and podocytes, suggesting that these cell types are impor- 
tant for the development of the GBM [59]. 


LAMININ-12 


A novel laminin was isolated via a precipitation experiment of human placental 
chorionic villi; it consisted of laminin chains a2, 61, and an unknown y chain 
[60]. On the basis of sequence analysis results, the laminin a chain was identi- 
fied as LAMA2, despite the fact that a monoclonal antibody targeting LAMA2 
was unable to recognize it. The laminin ß chain was identified using a monoclo- 
nal antibody specific to LAMB1 [60]. The identification of the laminin y proved 
to be more difficult because it contained sequences that were not found in any 
of the known laminin chains. Thus, Koch et al. concluded that they had found a 
novel laminin y chain that they named laminin y3 (LAMC3) [60]. The unique 
combination of chains generated laminin-12. 


LAMININS-13, -14, AND -15 


The expression profiles of laminins in different areas of the nervous system 
have been investigated in multiple studies. In the CNS, laminins-13 and -14, 
but not -15, are present at synaptosomes in, eg, the hippocampus [61]. In the 
inner retina, laminins have been seen in association with retinal ganglion 
cells [62-65], and LAMB2 was seen in association with photoreceptors 
[66,67]. In vitro studies revealed that LAMB2 promotes the phenotype of rod 
photoreceptors [66,68] and that it is vital for the correct development of pho- 
toreceptors [69]. Libby et al. [70] replicated the previous histological find- 
ings for the LAMB2 distribution and a similar distribution was also found 
in the human retina [70]. Isolation of laminin complexes from the retina 
revealed the presence of two novel laminin heterotrimers termed laminins-14 
(a4B2y3) and -15 (a5B2y3) [70]. Based on the findings, no conclusive data 
were presented, but it has been speculated that LAMB2 may be localized to 
synapses in the CNS as it is in the peripheral nervous system [71]. 

In the remainder of this chapter, tables representing findings for the indi- 
vidual chains are presented. 
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LAMA1 


Laminin Subunit 
al 


Gene name and 
number 


Mutations with 


diseases in humans 


Null mutation in 
mice 


Tissue distribution in 
healthy states 


Tissue distribution 
in pathologically 
affected states 


Special domains 


Description 

LAMA1 

UniProt: P25391 

Chromosome: 18p11.31 

Mutations in this gene may be associated with 
Poretti-Boltshauser syndrome. Mutations in 
LAMA1 cause cerebellar dysplasia and cysts with 
or without concomitant retinal dystrophy. 

Null mutations are not viable because they 
cause death at E6.5-E7 due to the absence of 
the Reichert membrane. However, conditional 
Lama? knockout mice in the epiblast lineage 
have been produced in Sox2-Cre mice. This led to 
hindered development of the cerebellum, which 
might partially be explained by the resulting 
conformational defects of the pial membrane. 

A marked reduction in numbers of dendritic 
processes in Purkinje cells was also observed in 
these mice. Adult Sox2-Cre-induced conditional 
Lama? knockout mice developed focal 
glomerulosclerosis and proteinuria with age. 
Another model revealed that Lama? knockout 

in mice led to a reduced integrity of the inner 
limiting membrane of the vitreous, because the 
vitreal astrocytes migrated into the vitreous. 
Present in developing epithelia, myotendinous 
junctions, and developing lung. It is also 
important for development of the cerebellum 

of mice, but does not generally seem to be 
associated with this tissue. 

Missense mutation in Lama? of mice (Y265C) led 
to a disruption of retinal vascular development 
and a thinning of the inner limiting membrane. 
Enhanced and differential distribution of LAMA1 
and LAMC1 distinctly associate these two 
laminins with Alzheimer disease. 

The laminin a chains are different from all 

other laminin chains because they contain a G 
domain. The G domain consists of five self-folding 
modules called laminin G modules (LG1-5). 

All a chains also contain a long coiled-coil 
domain (I/Il) that mediates heterotrimer assembly 
with similar domains in the B and y chains. The 
laminin «1, a2, a3B, and a5 chains each contain 
a short arm that consists of alternating globular 
domains (IVa, IVb, and VI/laminin N-terminal) 
and rod-like domains composed of laminin EGF- 
like repeats (Illa, IIb, and V) (see Fig. 29.1). 
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Laminin Subunit 


al Description References 
Special neoepitopes None found in literature. 

Protein structure See special domains. 

and function 

Binding proteins Dystroglycan (a chain), heparin, and sulfatides. [13,15;77,78] 


LAMA1 forms heterotrimers with laminin B and y 
chains (see Table 29.1). 


Known central Present during glomerulogenesis, but is not partof [79] 
function the mature kidney. 

Animal models with See “Null mutation in mice.” [73-75] 
protein affected A mouse model expressing an altered LAMA1 [80] 


version, lacking the globular domains 4 and 
5, developed as a fetus, but died due to failed 
epiblast polarization. 
In mice with collagen a3 (IV) chain knockout, [81] 
resulting in Alport syndrome, an upregulation 
of LAMA1 was seen in the GBM. The increased 
levels of LAMA1 and LAMA5 in the GBM 
contributed to an abnormal permeability. 
Biomarkers None found in the literature. [81] 
However, if the upregulation of LAMA1 in the 
GBM of the mouse model for Alport syndrome 
holds true for humans, LAMA1 could be a 
potential biomarker of Alport syndrome. 


p Laminin N-terminal a Laminin IV type A O EGF-like 


D Laminin IV type B O Laminin G-like 


FIGURE 29.1 Schematic representation of domains in the laminin chains. Different isoforms 
of the chains exist. For example, in the representation LAMA3 is presented as the LAMA3B ver- 
sion. In contrast, the LAMA3A lacks the N-terminal 1620 residues and has an altered sequence 
from residues 1621 to 1665. 
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LAMA2 


Laminin 
Subunit a2 


Gene name and 
number 


Mutations with 
diseases in 
humans 


Null mutation in 
mice 


Tissue 
distribution in 
healthy states 


Tissue 

distribution in 
pathologically 
affected states 


Special domains 


Special 
neoepitopes 


Description 

LAMA2 

UniProt: P24043 

Chromosome: 6q22-23 

Associated with merosin-deficient congenital 
muscular dystrophy and limb-girdle muscular 
dystrophy. 

Dilated cardiomyopathy with conduction defects 
have been noted in a patient with partial merosin 
(laminin-2) deficiency. In an observational cohort 
study, LAMA2 mutations were associated with 
cardiac problems. 

Null mutation in mice leads to growth retardation 
and severe muscular dystrophic symptoms and mice 
die by 5 weeks of age. Null mutations in Lama2 
expressed during lung development have not been 
reported to cause lung defects. The models for total 
Lama2 knockout all seem to result in death within 
the first 15 weeks after birth. All models targeting 
Lama2 have varying degrees of muscular dystrophy 
and neuropathy. There also seems to be defective 
spermatogenesis and odontoblast differentiation. 
LAMA2 is found in laminins-2, -4, and -12. 
Laminin-2 is a constituent of all myogenic tissues 
and peripheral nerves. Laminin-4 is part of the 
neuromuscular junction and mesangial matrix of 
glomerulus. Laminin-12 is found in ciliated epithelia, 
but the existence of this laminin is controversial. 
Based on levels of mRNA in the myocardium, there 
seems to be a negative correlation between levels of 
LAMA4 and age. 

There seems to be an imbalance between myocyte 
hypertrophy and the level of LAMA2. The imbalance 
might contribute to alterations in sarcolemmal 
properties that occur during the development of 
cardiac hypertrophy and its transition to cardiac 
failure. 

The laminin a chains are different from all other 
laminin chains because they contain a G domain. 
The G domain consists of five self-folding modules 
called laminin G modules (LG1-5). All a chains also 
contain a long coiled-coil domain that mediates 
heterotrimer assembly. Similar domains are found 

in the B and y chains. The laminin a1, a2, 3B, and 
a5 chains each contain a short arm that consists of 
alternating globular domains and rod-like domains 
composed of laminin EGF-like repeats (see Fig. 29.1). 
None found in the literature. 
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LAMA3 


Laminin 
Subunit a3 


Gene name and 
number 


Mutations with 
diseases in 
humans 


Null mutation in 
mice 


Tissue 
distribution in 
healthy states 


Tissue 

distribution in 
pathologically 
affected states 
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Description 

Not determined. 

The domains of the chain are shown in Fig. 29.1. 
Integrin «761, dystroglycan (a chain), and heparin. 
LAMA2 forms heterotrimers with laminin B and y 
chains (see Table 29.1). 

LAMA2 binds to the dystrophin-glycoprotein 
complex that is the mediator of the linkage between 
the sarcolemma to both the basement membrane and 
to the underlying cytoskeleton 

Hypomorphic mutation of Lama2 in mice has produced 
a protein that is missing a part of the N-terminal domain 
VI. This domain is necessary for trimer formation and 
the laminin is thus not able to polymerize. 

None found in the literature. 


Description 

LAMA3 

UniProt: Q16787 

Chromosome: 18q11.2 

Mutations in the protein have been shown to cause 
epidermolysis bullosa. 

Reverse transcription-polymerase chain reaction 
analysis of mRNA extracted from the proband’s 
keratinocytes identified a homozygous single 

base pair deletion in the transcripts encoding the 
laminins «3A and 3B isoforms. 

Nonsense mutations within LAMA3 have been 
shown to be associated with the pathogenesis of 
junctional epidermolysis bullosa. 

Targeted disruption of Lama3 in mice reveals 
abnormalities in survival and late-stage differentiation 
of epithelial cells. Severe skin blistering and 
abnormal glomerulogenesis were noted. 

LAMA3 is found in laminins-5, -6, and -7 of the 
epidermis. 

LAMB3 is also part of laminin-13 found in the CNS 
at synapses in the hippocampus. 

In the lungs, LAMA3 is expressed by alveolar 
epithelial cells. 

Disruption of LAMA3 leads to disorganized 
hemidesmosomes. The malfunction in crosslinking 
at the basement membrane between the epidermis 
and the dermis leads to fragility and blister 
formation of the skin. 

Production of laminin-5 (a3ß3y2) is uncoordinated 
in the airway epithelium of allergic asthmatics. 
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Laminin 
Subunit a3 
Special domains 


Special 
neoepitopes 
Protein structure 
and function 


Binding proteins 


Known central 
function 
Animal models 
with protein 
affected 
Biomarkers 


LAMA4 


Description 

The laminin a chains are different from all other 
laminin chains because they contain a G domain. The 
G domain consists of five self-folding modules called 
laminin G modules (LG1-5). All a chains also contain 
a long coiled-coil domain that mediates heterotrimer 
assembly. Similar domains are found in the B and 

y chains. The laminin a1, a2, a3B, and a5 chains 
each contain a short arm that consists of alternating 
globular domains and rod-like domains composed of 
laminin EGF-like repeats (see Fig. 29.1). 

Two well-described LAMA3 chains are presented in 
the literature: the A-chain and B-chain. The A-chain 
differs from the B-chain by the absence of the 
N-terminal 1620 residues and an altered sequence 
from residue 1621 to 1665. 

None found in the literature. 


The laminin a3 chain resembles a truncated version 
of the laminin a1 and a2 chains. To see a structural 
representation of the differences between these and 
other chains, see Fig. 29.1. 

a1ß3 integrin, plasminogen, and tissue plasminogen 
activator exhibit specific high affinity binding to the 
LG1 subdomain of the a3 chain of laminin. Other 
binding proteins are syndecan-2 and -4 and heparin. 
LAMA3 forms heterotrimers with laminin B and y 
chains (see Table 29.1). 

Part of the complex that forms hemidesmosomes of 
the epidermis and thus keeps this structure stable. 
Targeted disruption has been shown to lead to 
profound epithelial abnormalities, causing neonatal 
death of the affected mice. 

No biomarker as such has been developed, but 
LAMA3 could be of potential interest as high 
LAMA3 expression is associated with gastric cancer. 


Laminin Subunit a4 Description 


Gene name and 
number 


Mutations with 


diseases in humans 


LAMA4 

UniProt: Q16363 

Chromosome: 6q21 

Mutations of LAMA4 have been shown to give 
rise to cardiomyopathy, a disorder characterized 
by ventricular dilation and impaired systolic 
function. The mutations ultimately result in 
congestive heart failure and arrhythmia with 

a concomitant increase in risk of premature 
death. The disease is caused by defects in 
cardiomyocytes and endothelial cells. 
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References 
[43,47-49] 


Description 

During the embryonic and neonatal period, 
Lama4 knockout mice presented with 
hemorrhaging. Extensive bleeding, decreased 
microvessel growth and mild locomotion 
defects were also seen in experimental settings 
to monitor angiogenesis. Mice deficient in 
LAMA4 had an almost complete amyelination 
of nerves and presented with severe 
neuropathy. 

In healthy tissue, the LAMA4 chain is present 
in endothelium, smooth muscle (laminins-8 
and -9), adipose tissue, peripheral nerves 
(laminin-8), neuromuscular junctions 
(laminin-9), mesangial matrix of the glomerulus 
(laminin-9), CNS and retina (laminin-14) (see 
Table 29.1). It is also found in the lungs where 
the laminin a4 chain is expressed by lung 
fibroblasts. 

LAMA4 is highly expressed in human 
hepatocellular carcinoma in Chinese patients 
and is thought to be a novel marker of tumor 
invasion and metastasis. 

LAMA4 is also widely expressed in renal cell 
carcinomas and has a deadhesive effect. 
Downregulation of LAMA4 expression inhibits 
glioma invasion in vitro and in vivo. 

The epithelial-to-mesenchymal transition has 
been shown to downregulate LAMA5 and 
upregulate LAMA4 in oral squamous carcinoma 
cells, which could promote cancer cell 
invasion. 

The laminin a chains are different from all [7] 
other laminin chains as they contain a G 
domain. The G domain consists of five self- 
folding modules called laminin G modules 
(LG1-5). All a chains also contain a long 
coiled-coil domain that mediates heterotrimer 
assembly. Similar domains are found in the B 
and y chains. 

To see a representation of the domains found in 
LAMA4, see Fig. 29.1. 

The high expression of LAMA4 in different 
cancers could potentially be used to find 
neoepitopes that are more abundantly 

present in these pathological settings (see 
“Biomarkers”). 

The overall size of the LAMA4 chain is much 
smaller than that of other laminin a chains. It 
has four EGF-like domains, but the fourth is 
truncated. It still contains five LG domains (see 
Fig. 29.1). 

LAMA4 interacts with aVB3 integrin, heparin, 
syndecans-2 and -4. 
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Laminin Subunit a4 
Known central 
function 


Animal models with 
protein affected 


Biomarkers 


LAMA5 


Laminin Subunit a5 
Gene name and 
number 


Mutations with 
diseases in humans 
Null mutation in 
mice 


Description 

LAMA4 is thought to be implicated in the 
development of embryos, but the exact function 
is not known. The involvement of LAMA4 in 
development could be through its ability to 
promote microvessel formation. 

LAMA4 has been shown to bind to syndecans-2 
and -4 and promote cell adhesion. 

LAMA4 binds heparin through the LG4 domain. 
LAMA4 is also thought to stimulate 
angiogenesis by its interaction with the aVB3 
integrin, which is mediated by residues 
1121-1139 of the LG2 domain. 

Deficiency of Lama4 showed that this chain is 
involved in lipogenesis of epididymal adipose 
tissue, but not in lipogenesis of subcutaneous 
adipose tissue. 

Null mutation in mice resulted in impaired 
microvessel growth. 

Nude mice with an intracerebral injection 

of glioma cells transfected with an antisense 
oligonucleotide for LAMA4 resulted in the 
formation of a noninvasive tumor, whereas 
injection of glioma cells transfected with sense 
oligonucleotide for LAMA4 resulted in diffuse 
invasion of brain tissue. It thus seems that 
expression of LAMA4 promotes invasion of 
glioma cells both in vitro and in vivo. 

LAMA4 is a potential biomarker for invasion 
and metastasis in hepatocellular carcinoma. 

As LAMA4 has also been seen in association 
with cells from oral squamous carcinoma and 
renal cell carcinoma, it might be interesting to 
assess this chain for its potential as a diagnostic 
tool for various types of cancer. 


Description 

LAMA5 

UniProt: 015230 

Chromosome: 20q13.2-13.3 

No mutations have been described in the 
literature. 

Elimination of Lama5 results in multiple 
defects, eg, in glomerular filtration and late 
gestational lethality due to placental defects. 
The LAMAS deficiency also demonstrated a 
role for laminins containing this chain during 
development of the GBM, septations in the 
lung, hair follicles, dental epithelium, and 
submandibular gland morphogenesis. 
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Description References 
LAMAS is found in mature epithelium, mature [70,109] 
endothelium, smooth muscle, neuromuscular 
junction, GBM, and the CNS and retina. 

During glomerulogenesis, there is a transition 

in laminin gene expression from laminin-1 

and laminin-10 to laminin-11 as the GBM 

is maturing. In the GBM, laminin-11 forms 

an intricate meshwork with collagen type IV, 
perlecan, and nidogen. 

In the lungs, LAMAS is expressed by alveolar [96] 
epithelial cells. 

In collagen a3 (IV) knockout mice—the [81,112] 
Alport model—laminin a5 was highly 

present in the thickened GBM and led to 

an abnormal permeability. It seems that 

the LAMA5 LG1-2 tandem domain plays 

an essential role during development and 

harbors the majority of the functionality of 

the LAMA5 LG domain. Also, the LAMA5 

LG3-5 tandem serves as a novel determinant 
required for the kidney’s glomerular filtration 

barrier to plasma protein. 

The laminin a chains are different from all [7] 
other laminin chains because they contain 

a G domain. The G domain consists of five 
self-folding modules called laminin G modules 
(LG1-5). All a chains also contain a long 

coiled-coil domain that mediates heterotrimer 
assembly. Similar domains are found in the B 

and y chains. The laminin a1, a2, a3B, and a5 

chains each contain a short arm that consists 

of alternating globular domains and rod- 

like domains composed of laminin EGF-like 

repeats (see Fig. 29.1). 

LAMAS is the largest laminin chain (see Fig. (112) 
29.1). Although it contains the same types of 

domains as the other laminin a chains, it has 

been shown that some of the specific LAMA5 
domains are essential for development. 

Through a series of knock-in studies in Lama5- 
deficient embryos, it was shown that LG1—2 

were vital for development and that the LG3-5 
domains were necessary for correct assembly 

of the glomerular filtration barrier of the 

kidney. 

LAMAS is cleaved by MT1-MMP. [13] 
Thought to be part of basement membranes in 

the tissues mentioned above and functions as 

a stabilizing agent. 

Integrins 4361, «661, 0664, and Lutheran [12,114] 
glycoprotein bind to LAMA5. 

LAMA5 forms heterotrimers with laminin ß 

and y chains (see Table 29.1). 
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Laminin Subunita5 Description References 
Known central Part of the GBM where it is thought to stabilize [115] 
function and regulate permeability. 

Animal models with Due to the lethality of total deficiency of HA2] 
protein affected laminin a5 chains, a chimeric animal model 


was created with the traditional LAMA5 G 
domain being swapped for that of LAMA1. 
Through various knockin experiments, it was 
shown that the LG1-2 domains are vital for 
development and that the LG3—5 domains 
are necessary for correct assembly of the 
glomerular filtration barrier of the kidney. 
A podocyte-specific LAMA5 deficiency was [116] 
created by Cre-lox and tetO. This led to 
varying degrees of proteinuria and rates of 
progression to nephrotic syndrome. Restoration 
of podocyte-specific LAMA5 expression 
led to restoration of normal function. The 
specific ablation of LAMAS in podocytes 
with concomitant development of proteinuria 
suggests that LAMA5 expression by podocytes 
is necessary for maintenance of GBM integrity. 
A study in mice showed that a hypomorphic MZ 
mutation caused polycystic kidney disease. 
In a mouse model of Alport syndrome, [81] 
knockout of collagen a3 (IV) led to 
upregulation of LAMA5 in the GBM. 
Biomarkers If the upregulation of LAMA5 in the mouse [81] 
model for Alport syndrome is also true 
for humans, LAMA5 could be a potential 
biomarker of Alport syndrome. 
Also, because MT1-MMP is able to cleave {113] 
LAMAS, and prostate cancer cells have been 
shown to express MT1-MMP, fragments of 
LAMAS could be potential biomarkers in 
prostate cancer. 


LAMB1 
Laminin Subunit B1 Description References 
Gene name and LAMB1 Gene ID: 3912 
number UniProt: PO7942 
Chromosome: 7q22 
Mutations with Mutations in LAMB 1 cause cobblestone brain [118] 


diseases in humans malformation without muscular or ocular 
abnormalities. The disease is an autosomal 
recessive brain malformation characterized 
by cobblestone changes in the cortex that 
are more severe in the posterior region, 
and subcortical band heterotopias. Affected 
individuals have hydrocephalus, seizures, and 
severely delayed psychomotor development. 


Laminin Subunit 81 
Null mutation in 
mice 


Tissue distribution in 
healthy states 


Tissue distribution 
in pathologically 
affected states 


Special domains 


Special neoepitopes 
Protein structure and 
function 


Binding proteins 


Known central 
function 


Animal models with 
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Biomarkers 


LAMB2 


Laminin Subunit 62 
Gene name and 
number 


Mutations with 
diseases in humans 
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Description 

Knockout of this gene results in early death 
of mice at around E5.5. The lethality is 
due to absence of basement membrane 
formation. 

Developing and mature epithelium 
(laminins-1, -6, and -10), myogenic 

tissue, peripheral nerves (laminins-2 and -8) 
endothelium (laminins-8 and -10), 

smooth muscle (laminin-10), and adipose 
tissue. 

It has been demonstrated that LAMB1 and 
integrin a2 expressions are elevated in the 
anterior temporal neocortex tissue from 
patients with intractable epilepsy. 

The LAMB1 and LAMB2 chains share similar 
domains. The LAMB3 chain, however, 
resembles a truncated version and does not 
contain a laminin IV type B domain 

(see Fig. 29.1). 

None found in the literature. 

The three-dimensional structure of LAMB1 has 
not been identified. 

As null mutation of Lamb7 in mice 

leads to early death. The LAMB1 chain 

plays an important role during early 
development. 

LAMB1 forms heterotrimers with laminin a 
and y chains, thereby forming laminins-1, -2, 
-6, -8, -10, and -12 (see Table 29.1). 

LAMB1 has also been predicted to interact 
with collagen type XVII and integrins a1, a6, 
and a7. 

LAMB1 is an integral part of most basement 
membranes of the body. 

Other functions: See above. 

None found in the literature. 


None found in the literature. 


Description 

LAMB2 

UniProt: P55268 

Chromosome: 3p21 

LAMB2 deficiency causes Pierson syndrome, a 
congenital nephrosis with mesangial sclerosis 
and distinct eye abnormalities. Hypomorphic 
mutations in LAMB2 cause milder phenotypes 
of Pierson syndrome. 
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Laminin Subunit B2 
Null mutation in 
mice 


Tissue distribution in 
healthy states 


Tissue distribution 
in pathologically 
affected states 


Special domains 


Special neoepitopes 
Protein structure and 
function 

Binding proteins 


Known central 
function 


Animal models with 
protein affected 


Biomarkers 


Description 

A Lamb2 knockout does not affect the 
development of the GBM because LAMB1 
seems to be able to partially replace 

LAMB2. However, there are serious functional 
consequences of the LAMB2 deficiency that 
lead to defects in the neuromuscular 
junctions, severe proteinuria, and retinal 
pathology. It thus seems that LAMB2 

is indispensable for the integrity of 

the GBM. 

Myotendinous junctions (laminin-3), 
neuromuscular junction (laminins-4, -9, 

and -11), epidermis (laminin-7), endothelium 
(laminins-9 and -11), smooth muscle 
(laminins-9 and -11), mesangial matrix of 
the glomerulus (laminins-4 and -9), GBM 
(laminin-11), the CNS and retina 
(laminins-13, -14, and -15). 

Overexpression of the LAMB2 chain 

is associated with increased basement 
membrane thickness and is possibly related to 
spermatogenic dysfunction. 

The LAMB1 and LAMB2 chains share similar 
domains. In comparison, the LAMB3 chain 
resembles a truncated version and does not 
contain a laminin IV type B domain 

(see Fig. 29.1). 

None found in the literature. 

See “Special domains.” 


[123] 


a3ß1 and a7X2ß1 integrins bind more avidly 
to LAMB2 containing laminins than LAMB1 
containing laminins. 

LAMB2 also interacts with nidogen. 

LAMB2 forms heterotrimers with laminin a and 
y chains (see Table 29.1). 

Components of laminin heterotrimers in 
basement membrane (see “Tissue distribution in 
healthy states”). 

LAMB2 deficiency causes the glomerular 
filtration barrier to fail, leading to severe 
proteinuria. 

Mutations of Lamb2 have shown that it 
regulates formation of motor nerve terminals. 
None found in the literature. 
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Description 

LAMB3 

UniProt: Q13751 

Chromosome: 1q32 

Epidermolysis bullosa, junctional 
epidermolysis bullosa, Herlitz-type generalized 
atrophic benign epidermolysis bullosa 

Not viable as it causes perinatal death. 
Deficiency was achieved through 
spontaneous insertion of an intracisternal-A 
particle in the gene encoding LAMB3. 
Homozygous mice survived for less than 

30h after birth and had very fragile skin, 
which resulted in dislodgement of the entire 
epidermal layer of the paws, limbs, tail tips as 
well as causing blisters of the skin. The model 
resembles junctional epidermolysis bullosa. 
Found at the dermal-epidermal junction 


Laminin-5 (a3ß3y2) staining is attenuated 

in patients with the mutations that cause 
junctional epidermolysis bullosa. 
Uncoordinated production of laminin-5 
occurs in lungs of allergic asthma patients, 
which might play a role in the poor epithelial 
cell anchorage in these patients. 

In biliary cancer LAMB3; LAMC2, which 
constitutes two of three chains of laminin-5; 
and MMP-7 are overexpressed. 

None found in the literature. 

LAMB3 is the smallest of the laminin B chains 
(see Fig. 29.1). 

LAMB3 is cleaved by MT1-MMP. 

Cell surface collagen XVII can interact with 
laminin-5 and together they participate in the 
adherence of a cell to the ECM. 

ECM1 interacts with fibulin-3 and the LAMB3 
chain of laminin-2. 

LAMB3 forms heterotrimers with laminin a 
and y chains (see Table 29.1). 

See “Protein structure and function.” 


A mouse model for junctional epidermolysis 
bullosa has been developed. 
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Laminin Subunit B3 


Biomarkers 


LAMC1 


Laminin 
Subunit y1 


Gene name and 
number 


Mutations with 
diseases in 
humans 

Null mutation in 
mice 

Tissue 
distribution in 
healthy states 
Tissue 
distribution in 
pathologically 
affected states 


Special domains 


Special 
neoepitopes 
Protein structure 
and function 


Binding proteins 


Description 

Laminin-5 (a3B3y2) is a biomarker of 
invasiveness in cervical adenocarcinoma. 
Because LAMB3 has been shown to be a 
prometastatic gene during lung cancer, it 
could have potential as a biomarker to detect 
metastatic lung cancer. 

MT1-MMP has been shown to be upregulated 
in prostate cancer. Because it is also able to 
cleave LAMB3 in vitro and thereby cause cell 
migration, it could be interesting to assess the 
potential of this laminin chain as a biomarker. 


Description 

LAMC1 

UniProt: P11047 

Chromosome: 1q31 

A polymorphism in the promoter of LAMC7 may 
increase susceptibility to early-onset pelvic organ 
prolapse 

Knockout results in embryonic lethality due to 
failure of endoderm differentiation. 

LAMC1 is present in most basement membranes 
(ie, in laminins-1, -2, -3, -4, -6, -7, -8, -9, -10, -11) 
(see Table 29.1). 

LAMC1 expression seems to be associated with 
meningioma grades and could play a role in 
enhancing tumor invasion. 

In samples of serous tubal intraepithelial carcinoma 
an altered staining pattern and increased 
immunoreactivity of LAMC1 were seen. 

Enhanced and differential distribution of LAMA1 
and LAMC1 distinctly associate these two laminins 
with Alzheimer disease. 

The neurite outgrowth domain of LAMC1 
accumulates in the floor plate region of the 
notochord and in glial fibrillary acidic protein- 
immunoreactive glial fibers of the embryonic spinal 
cord. 

None found in the literature. 


The nanostructure of the LAMC1 short arm has 
been determined. In solution, it adapts an extended 
conformation. 

Nidogens-1 and -2 

LAMC1 forms heterotrimers with laminin a and 6 
chains (see Table 29.1). 
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Description 

LAMC1 is found in most basement membranes. 
The interaction of LAMC1 with nidogen seems to be 
necessary for the development of lung and kidney. 
Specific ablation of the nidogen binding site in the 
laminin y1 chain interferes with kidney and lung 
development. The deletion of the nidogen binding 
site also leads to cortical dysplasia. Cell type- 
specific deficiency of LAMC1 in Schwann cells 
lead to peripheral nerve defects. 

Because of the association with meningioma grade, 
recurrence and progression-free survival LAMC1 
might be a biomarker for this malignancy. 

Because alterations in staining patterns and an 
increased immunoreactivity of LAMC1 are seen in 
serous tubal intraepithelial carcinoma, it could be 
an interesting target for biomarker development. 
Because LAMC1, together with LAMA1, are 
distinctly associated with Alzheimer disease, they 
might prove to be biomarkers. 


Description 

Laminin y2 chain (LAMC2) 

UniProt: Q13753 

Chromosome: 1q25-31 

Mutations in all of the chains found in laminin-5 
(a3B3y2) can lead to epidermolysis bullosa, 
junctional epidermolysis bullosa, or Herlitz-type 
epidermolysis bullosa. 

Null mutations cause perinatal death. 

Targeted inactivation of Lamc2 produces a model 
similar to human junctional epidermolysis bullosa 
and is characterized by blister formation of the 
skin. 

Abnormal tracheal hemidesmosomes were also 
observed in Lamc2 null mice. 

In the epidermis, it is mainly found in the basement 
membrane within laminin-5 (see Table 29.1). 

In normal esophageal tissues, LAMC2 is expressed 
in the basement membrane. 

Laminin-5, containing LAMC2, has been found 

to be associated with invasiveness of cervical 
adenocarcinomas. 

LAMC2 has also been shown to be increased in 
the circulation of patients with early-phase acute 
lung injury. 
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LAMC3 


Laminin 
Subunit y3 


Gene name and 
number 


Mutations with 
diseases in 
humans 

Null mutation in 
mice 


Description 

LAMC2 may play a role in the progression of 
esophageal squamous cell carcinoma, and 
simultaneous expression with secreted protein 
acidic and rich in cysteine (SPARC) is correlated 
with poorer prognosis. 

Uncoordinated expression of laminin-5 is found 
in the airway epithelium of patients with allergic 
asthma. 

None found in the literature. 

LAMC2 has been shown to be cleaved by MMP-2 
between residues 586 and 587, exposing a putative 
cryptic promigratory site that triggers cell motility. 
LAMC2 is cleaved by MT1-MMP, causing cell 
migration. 

Bone morphogenetic protein-1 can cleave 
LAMC2. 

Part of the laminin heterotrimer (see Table 29.1). 
Part of the basement membrane of epidermis and 
esophageal tissues (see Laminin-5 section). 
Syndecan-1 

LAMC2 forms heterotrimers with laminin a and B 
chains (see Table 29.1). 

Part of the basement membrane. 


Targeted inactivation and a hypomorphic mutation 
of Lamc2 produce models resembling human 
junctional epidermolysis bullosa. 

Laminin-5 is a biomarker of invasiveness in 
cervical adenocarcinoma. Because LAMC2 

is a part of laminin-5, it could be a potential 
biomarker for this type of cancer. 

LAMC2 is also a promising biomarker for 
pancreatic cancer and bladder cancer. 

LAMC2 is a biomarker of acute lung injury. 


Description 

LAMC3 

UniProt: Q9Y6N6 

Chromosome: 9q34.12 

Recessive LAMC3 mutations cause malformations 
during the development of the occipital cortex 


LAMC3 null mice have no apparent phenotype. 
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Laminin 
Subunit y3 Description References 
Tissue In situ hybridization of the midfetal human [159] 
distribution in brain results in robust LAMC3 expression in the 
healthy states cortical plate and within the germinal zones of the 
neocortical wall, including the ventricular zone, 
subventricular zone, the outer subventricular zone, 
and the intermediate zone. In contrast, LAMC3 
expression in the developing mouse brain is limited 
to the vasculature and meninges. 
Tissue A homozygous two-base pair deletion was identified [159] 
distribution in LAMC3, which led to a premature termination 
pathologically codon. The premature termination resulted in 
affected states malformation of occipital cortical development. 
It is not known how this affects the distribution of 
LAMC3 in pathologically affected states. 
Special domains Laminin isoforms containing LAMC3 are unable to [161] 
bind to integrins due to the absence of the glutamic 
acid residue conserved in the C-terminal regions of 
LAMC1 and LAMC2. 
Special None found in the literature. 
neoepitopes 
Protein structure Part of laminins-12, -13, -14, and -15, which are 
and function found in surface of human chorionic placenta villi 
and basement membrane of the CNS and the retina 
(see Table 29.1). 
Binding proteins | Nidogens-1 and -2. [162] 
In contrast to LAMC1 and LAMC2, LAMC3 is not [161] 
able to bind integrins because of absence of a 
glutamic acid residue required for this interaction. 
LAMC3 forms heterotrimers with laminin a and B 
chains (see Table 29.1). 
Known central Has only been found in laminins of the CNS and 
function surface of ciliated cells. It is thus believed that 
LAMC3 may play an important role in the formation 
and integrity of the basement membrane of the CNS. 
Animals models Mice homozygous null for Lamb2 and Lamc3 [163] 


with protein exhibit cortical laminar disorganization and exhibit 

affected hallmarks of human cobblestone lissencephaly (type 
Il, nonclassical). 

Biomarkers None found in the literature. 


E, embryonic day; EGF, epidermal growth factor; G domain, large 100-kDa, C-terminal globular 
domain; GBM, glomerular basement membrane; LG, laminin G; MT1-MMP, membrane type 1 
matrix metalloprotease; MMP, matrix metalloprotease; ECM, extracellular matrix. 
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SUMMARY 

Elastin is a key extracellular matrix (ECM) protein that provides resilience and elasticity 
to tissues and organs. Elastin is roughly 1000 times more flexible than collagens; thus, 
the main function of elastin is the elasticity of tissues. It is the dominant protein in exten- 
sible tissues and is primarily present in the lungs, aorta, and skin. Mutations in the elastin 
gene may lead to diseases such as Williams—Beuren syndrome, cutis laxa, and supra- 
valvular aortic stenosis (SVAS). The precursor of elastin is tropoelastin. Tropoelastin is 
derived from fibroblasts, smooth muscle cells, chondrocytes, or endothelial cells before it 
is processed to elastin by cleavage of its signal peptide. Elastin monomers are crosslinked 
during the formation of desmosine molecules. Several important and well-described bio- 
markers of elastin degradation are available. 


Elastin is a key ECM protein that provides resilience and elasticity to tissues and 
organs. It is one of the dominant proteins in extensible tissues and is primarily 
present in the lungs, aorta, and skin [1]. Elastin constitutes the majority of elastic 
fibers. During fiber formation, it binds to other structural ECM molecules such 
as fibulins, and fibrillins with an e-(y-glutamyl) lysine bond [2—4]. The binding is 
facilitated by transglutaminase (Tgase). 

The precursor of elastin is tropoelastin. Tropoelastin is derived from fibro- 
blasts, smooth muscle cells, chondrocytes, or endothelial cells before it is pro- 
cessed to elastin by cleavage of its signal peptide [5]. Elastin monomers can be 
crosslinked during the formation of desmosine molecules that are formed by the 
crosslinking of three allysine and one lysine residue. The process is facilitated 
by lysyl oxidase that oxidizes elastin lysine residues into allysine [6]. Elastin 
contains several hydrophobic domains that are dominated by the residues valine, 
glycine, and proline (UniProt: P15502 [7]). In addition to the hydrophobic 
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domains elastin has several special domains such as multiple hydroxyproline 
and allysine sites, a polyalanine domain (residues 97—102), and alanine-rich 
domains (residues 236—742) as well as the rarely expressed exon 26A [7-10]. 
Its crosslinks render elastin remarkably hydrophobic and stable, with a long 
half-life of up to 40 years [11]. 

The roles of elastin in the development of the lungs in particular have been 
thoroughly investigated (Table 30.1). Studies in mice have shown that elastin 
null mutations result in fewer, larger air sacs in the lungs and severe branch- 
ing defects [12]. Elastin-deficient mice are also more susceptible to smoking- 
induced emphysema than mice with normal elastin levels [13]. Elastin is also 
responsible for the elasticity of the conduit arteries [14]. As the body ages, the 
elastin contents of the arteries declines, thereby increasing arterial stiffness and 
thus the pulse pressure [14]. This has also been demonstrated in animal models 
with induced hypertension where the elastin-to-collagen ratio decreases, which 
results in stiffer arteries [15,16]. For comparison, it is estimated that collagens 
are more than 1000 times stiffer than elastin [17]. Last, mutations in the elastin 
gene may lead to diseases such as Williams—Beuren syndrome, cutis laxa, and 
SVAS [18-20]. 

Elastin synthesis peaks during the embryonic phase and early child- 
hood [21]. The lower elastin turnover occurring during adulthood enables 
the detection and quantification of abnormal elastin degradation or distribu- 
tion [22,23]. Elevated levels of elastin fragments have been found in adult 
patients diagnosed with chronic obstructive pulmonary disease (COPD), 
idiopathic pulmonary fibrosis, lung cancer, or cardiovascular diseases 
[22,24—26]. Furthermore, increased elastin degradation rates, as demon- 
strated by the presence of increasing matrix metalloproteinase—and human 
neutrophil elastase—mediated elastin fragmentation, may indicate acute 
exacerbations in COPD patients [27]. 

Ordered distribution of elastin depends on controlled elastin crosslinking 
activity. Circulating elastin fragments containing desmosine can be quanti- 
fied with a wide range of techniques including enzyme-linked immunosorbent 
assay, high-performance liquid chromatography—mass spectrometry, and cap- 
illary electrophoresis [28]. Levels of these desmosine-containing fragments 
may be used to monitor fibrosis in interventional studies, although more 
research is required to confirm the utility of this potential marker. Elastin 
fragments are not merely a consequence of tissue destruction; they may also 
act as pathological mediators. In emphysema, elastin fragments stimulate an 
immune response in lung tissues, leading to the upregulation of cytokines 
with a further increase in protease expression and subsequently to progressive 
destruction of the lung ECM [29]. During skin repair, elastin signaling attenu- 
ates wound contraction and stimulates protease production, cell migration, 
and proliferation as well as matrix synthesis [30]. Last, peptides expressed 
from the rarely transcribed or translated exon 26A are believed to induce cell 
migration and to have chemotactic properties [10]. Future studies may reveal 
additional regulatory properties of elastin. 


TABLE 30.1 Elastin 
Elastin 


Gene name and number 


Mutations with diseases in 
humans 


Null mutation in mice 

Tissue distribution in healthy states 
Tissue distribution in 
pathologically affected states 


Special domains 


Special neoepitopes 


Protein structure and function 


Binding proteins 
Known central function 
Animals models with protein 


affected 


Biomarkers 


Description 
Gene name ELN, location 7q11.1-q21.1 


Williams—Beuren syndrome, cutis laxa, SVAS 


SVAS. Impaired lung development. Mice with elastin deficiencies are more prone to 
emphysema. Elastin knockout (ELN-/—) mice die shortly after birth. 


Primarily in the aorta, skin, and lungs 


Proteolytic degradation and chemical alterations of elastin have been observed in 
pathologies affecting the lungs and arteries 


Multiple hydroxyproline and allysine sites. Polyalanine domain (residues 97—102), 
alanine-rich domain (residues 236-742). Hydrophobic regions. Exon 26A. 


ELM and ELM2 are elastin fragments generated by MMP-9 and MMP-12 cleavage of 
elastin. ELM7 and EL-NE are elastin fragments generated by MMP-7 and HNE cleavage of 
elastin, respectively. Upregulation of elastin fragments in the circulation can be coupled 
with pulmonary or arterial disorders. Peptides derived from exon 26A. 


Cleavage of tropoelastin generates elastin monomers. Elastin and tropoelastin consist of 
hydrophobic and hydrophilic domains. The hydrophobic domains of elastin and crosslinking 
of elastin monomers render elastic fibers insoluble and stable. Elastin ensures tissue elasticity. 


Binds mainly to other elastin monomers, fibrillins, and fibulins. Binding and crosslinking is 
facilitated by LOX and Tgase. 


Structural protein that provides elasticity to tissue. Fragments of elastin may stimulate the 
immune system. 


Elastin-deficient mice are used to study lung development and susceptibility to 
emphysema caused by smoking. 


ELM, ELM2, ELM7, EL-NE, desmosine 
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SUMMARY 

A central event in tissue remodeling is expression and activation of extracellular matrix 
(ECM)-degrading proteinases and collagenases, resulting in the release of small protein 
fragments called neoepitopes into the circulation. These neoepitopes can and are being 
targeted as biochemical markers of “end products of tissue destruction.” 

Neoepitopes can include different posttranslational modifications (PTMs) such as 
citrullination, nitrosylation, glycosylation, and isomerization. Each modification is a 
result from specific and local physiological processes. Identification of PTMs on neoepi- 
topes may deliver unique disease-specific biomarkers and have been identified in mus- 
culoskeletal, cardiovascular, fibrotic, and neurodegenerative diseases. The neoepitope 
biomarkers relying on analytes that are modified by multiple PTMs may become optimal 
tools that meet the burden of disease, investigatory, prognostic, efficacy of intervention 
and diagnosis (BIPED) biochemical marker “usefulness” criteria. 
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In this chapter, the use of structural biomarkers is discussed, with emphasis on various 
PTMs in different disease indications. The use of the BIPED criteria is likewise discussed, 
as well as the importance of understanding the effect of the matrix in which the biomarker 
is measured, such as serum, urine, and other body fluids, on interpretation of results. 


ECM remodeling, in which old or damaged proteins are degraded and 
replaced by new, intact proteins, is a key process in tissue homeostasis. Spe- 
cific proteolytic activities are a prerequisite for a range of cellular functions 
and interactions within the ECM during this remodeling. These specific activi- 
ties are balanced to promote tissue turnover and renewal. In disease states, this 
repair—response balance is disrupted, leading to excessive tissue turnover in 
which the ECM proteins are replaced by new and potentially different ECM 
proteins. An imbalance in ECM remodeling results in the release of small ECM 
protein fragments into the systemic circulation. If detected, these fragments 
may be used as molecular biochemical markers of various pathologies. 

Several biochemical markers (biomarkers) measuring ECM remodeling 
have become available in recent decades, and they may provide tools for inves- 
tigation of the ECM imbalance observed in different diseases as well as pro- 
viding in vitro diagnostic tools for the assessment of treatment efficacy and 
prognosis of patients diagnosed with chronic diseases such as osteoporosis (OP) 
[1,2], rheumatoid arthritis (RA), chronic obstructive pulmonary disease, and 
cancers [3]. Many of the biomarkers have been categorized according to the 
BIPED classification developed by the National Institutes of Health—industry 
partnership funded by the Osteoarthritis Biochemical Markers Network [4—7]. 
The Food and Drug Administration critical path initiative launched in 2004 fur- 
ther emphasized the need for translational science and the use of biochemical 
markers during drug discovery and development [3]. 

Below, we describe the biomarkers that are available to measure ECM 
turnover. 


THE CONVERGENCE OF DIFFERENT PATHWAYS LEADS 
TO ALTERED TURNOVER OF THE ECM 


A common denominator of many chronic and inflammatory diseases is elevated 
levels of proinflammatory cytokines, including, but not limited to, the inter- 
leukin (IL)-1, IL-6, IL-17, and IL-18 and tumor necrosis factor (TNF)-a [8]. 
The combined proinflammatory burden is associated with substantial cell sur- 
face receptor activation in different tissues, resulting in substantial alterations 
in cellular signaling mechanisms involving, in particular, nonreceptor tyrosine 
kinases such as Syk, Jak, Src, and PI3K [9]. This leads to increased tissue turn- 
over with an imbalance between tissue formation and degradation, resulting in 
massive connective tissue destruction and failure [10]. Such antiinflammatory 
treatments have focused on reducing or ablating the pathological activity in 
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tissues, either by directly targeting the pathological cytokines (as is the case 
with antibodies against cytokines or their receptors), or by reducing the activ- 
ity of their downstream signaling mediators (so-called kinase inhibitors). The 
kinase signaling pathways are prime targets for pharmaceutical intervention and 
have proven successful in treating inflammation [11]. 

One example of successful targeting of the kinase signaling pathway is in 
RA, a chronic autoimmune disease characterized by inflammation in multiple 
synovial joints leading to tissue destruction and joint failure. In principle, the 
inflammatory burden of disease results in the activation of transmembrane 
cytokine receptors leading to the activation of intracellular signaling pathways 
including JNK, ITAM, PLC, NFAT, AKT, STAT, ERK, SYK, LYN, FYN, and 
JAK [11-13]. Activation of the signaling pathways ultimately leads to the 
expression of powerful proteolytic and collagenolytic enzymes, in particular 
the matrix metalloproteinases (MMPs) and aggrecanases [14,15] (Fig. 31.1). 
The proteolytic enzymes are secreted partly by cells of the affected tissues and 
partly by elevated levels of invading inflammatory cells such as macrophages, 
T cells, and neutrophils. This increased proteolytic activity destroys the ECM, 
resulting in the release of ECM fragments that are cleared from the joint and 
released into the circulation as biomarkers [16-18]. Furthermore, the inflam- 
matory burden activates the fibroblasts to become myofibroblasts that express 
novel ECM proteins. In this process, proteins fragments are also released into 
the circulation from different types of collagen, many of which contain propep- 
tides that are posttranslationally cleaved as part of tissue incorporation [19]. In 
RA, MMP-mediated degradation of ECM collagen proteins has resulted in the 


Pro-inflammatory 
cytokines 


fa 
VA 


FIGURE 31.1 Neoepitopes of the extracellular matrix (ECM) are the end product of tissue 
destruction. Damage to the tissue increases the level of proinflammatory cytokines that initiate a 
signaling cascade leading to an upregulation of proteases such as matrix metalloproteinases that 
degrade the ECM, resulting in the release of neoepitopes that can be used as biomarkers. 
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development of specific biomarkers: CIM and ICTP; C2M, C2C, C-terminal 
telopeptide of type II collagen (CTX-I]), and C3M, measuring degradation of 
types I, II, and II collagen, respectively [20-23]. Other noteworthy biomarkers 
of the joint are: CTX-I, used to measure bone resorption mediated by cathepsin 
K activity [24]; ARGS or AGNx! and FFGV, measuring aggrecan degraded 
by aggrecanases or MMPs [25,26]; and PIINP, Pro-C2, type HA procollagen 
amino-terminal propeptide (PIIANP), formation biomarkers of type II collagen 
degradation [27-31]. As a tissue-specific example, type II collagen is almost 
exclusively expressed in cartilage. C2M, CTX-II, and C2C are consequently 
markers of cartilage degradation; hence, they are used as a tool to track changes 
in cartilage turnover in rheumatological conditions [3]. 

In RA and other chronic inflammatory diseases a range of cytokines may 
activate different cellular pathways, which end in the production of the “usual 
suspects” of a battery of proteases inducing tissue turnover. Although the cyto- 
kine signals may be many, and the intracellular signaling pathways diverse, the 
end products that are specific for given tissues—the small protein fragments 
that are released from the tissue due to excessive protease activity—may be 
considered the result of the convergence of pathways. 


DIFFERENT CLASSES OF ECM BIOMARKERS 


ECM biomarkers can be divided into three classes (Fig. 31.2). The first class (A) is 
often based on total protein measurement, and different prerequisites are needed for 
each assessment (Fig. 31.2A). The second class (B) assesses the presence of a single 
specific ECM protein fragment or PTMs of an ECM protein. The third class (C) of 
biomarkers enables measurement of multiple PTMs (eg, those arising from frag- 
mentation together with oxidation; see description below). Because we now know 
that collagens and other ECM proteins are degraded before being released into the 
circulation, the assay developed to measure the proteins should be designed so that 
a most of the original protein components can be detected. In an enzyme-linked 
immunosorbent assay (ELISA), this could be achieved by using two antibodies 
targeting the N and C termini of the protein. Keeping in mind the normal turnover of 
ECM, class A biomarkers seem more appropriate for measurement of cytokines and 
growth factors that exist in their latent form in the circulation. Importantly, class A 
biomarkers can be encapsulated by macroglobulins, chaperones, or other transport 
or encapsulating proteins, which can mask the epitope of interest [32]. 

Class B biomarkers (Fig. 31.2B) of the ECM are probably used more often 
to detect protein fragments in the circulation than class A. However, several 
epitopes per protein exist. Class B can detect neoepitopes generated as part 
of the ECM turnover, modifications of the ECM proteins generated by path- 
ological processes, or a combination of the two. The different subclasses of 
biomarkers can be detected by different proteomic approaches such as mass 
spectrometry and ELISA. ELISAs can be in both competitive and sandwich for- 
mats, because only one antibody is needed to detect a specific peptide sequence 
with a PTM on the antigen. The biomarkers targeting class B epitopes are often 
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FIGURE 31.2 There are different types of extracellular matrix (ECM) biomarkers. (A) 
Measurement of full-size proteins often requires antibodies (Abs) with a broad range, eg, Ab1 tar- 
geting the N-terminal domain and Ab2 targeting the C-terminal domain. (B) Biomarkers of the 
ECM often target fragments of the molecule (Ab3). These fragments can be modified by different 
processes, such as oxidation, which then can be targeted as specific biomarkers (Ab4). A posttrans- 
lational modification process may destroy an epitope measuring the unmodified fragment (crossed 
Ab3). Also, some proteins are encapsulated by chaperones, masking the epitope of interest. (C) 
Isomerized type I collagen is the major collagen of the bone. It is a crosslinked triple helix that is 
degraded by the osteoclast enzyme cathepsin K. 


greatly informative for prognosis, but they are seldom organ or tissue specific. 
An example is the biomarker VICM, a citrullinated and MMP-generated frag- 
ment of vimentin and a biomarker of macrophage activity [33,34]. 

Class C biomarkers (Fig. 31.2C) are tissue and potentially organ specific. 
Specificity may be achieved by a combination of different PTM processes that 
are pathospecific on proteins that are more abundant in the ECM in the tissue of 
interest. The identification of such biomarkers is cumbersome, and only a few 
examples of ECM-specific biomarkers are available. One example is B-CTX-I, 
which is described later in this chapter. 

This separation of biomarkers into different classes is important because dif- 
ferent technologies provide different levels of accuracy and precision. Numerous 
novel PTM processes on ECM proteins are discovered by proteomic approaches 
such as oxidation, citrullination, proteolytic fragmentation, glycosylation, and 
nitrosylation [35,36]. This suggests additional applied research is required to 
understand the exact analyte measured, because different PTMs may have dif- 
ferent pathological meanings and measurement of different epitopes provides 
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completely opposite information. Thus, different PTMs may, in part, explain why 
different versions of analytical methods thought to be measuring the same pro- 
tein often lead to divergent results. A well-characterized example of this issue is 
measurement of type I collagen and its degradation fragments in postmenopausal 
OP [2,37]. Measurement of total type I collagen in serum would suggest that 
postmenopausal women have more type I collagen than age-matched individuals 
[37]. However, whether the increased levels are the consequence of degradation, 
or reduced formation of new type I collagen, is not revealed by a class A bio- 
marker. Propeptides of proteins are released during protein synthesis and can be 
measured as surrogates for tissue formation. The N-terminal propeptide of type 
I collagen (PINP) is released as the protein is incorporated into the ECM; con- 
sequently, PINP in the circulation is used to measure both bone formation and 
fibrogenesis [38]. Meanwhile, the degradation of type I collagen is mediated by 
several proteinases, which yields several different degradation products of type I 
collagen. CTX-I is widely used in ELISA for the detection of cathepsin K—medi- 
ated destruction of type I collagen. Because the combination of type I collagen 
and cathepsin K is mainly observed in bone, CTX-I is considered a unique marker 
of bone resorption and is widely used as such [39]. Furthermore, at least two 
different versions of CTX-I exist, an a and a B version. Both measure the same 
cathepsin K—generated cleavages, but the B version measures a }-isomerized (oxi- 
dized) form of the fragment, whereas the a version measures the nonisomerized 
form. This can be translated into B-CTX-I, reflecting resorption of old bone as 
seen in OP and into a-CTX-I, reflecting the resorption of young bone, as seen in 
subchondral bone remodeling in osteoarthritis (OA) and bone metastasis [40,41]. 
Another biomarker of type I collagen destruction is type I collagen—derived cross- 
linked carboxy-terminal telopeptide (ICTP), which is a triple-crosslinked car- 
boxyl-terminal telopeptide of type I collagen generated by MMP and destroyed 
by cathepsin K [24]. ICTP is mainly generated in inflamed connective tissue. 
Last, the C1M is released from the helical domain of type I collagen, also by the 
action MMPs [22]. The different fragments of type I collagen may provide differ- 
ent information, such as the occurrence of opposite processes of bone formation 
and bone resorption, by the different measurements of the same molecule. 


PTMs OF ECM PROTEINS 


Identification of tissue-, protein-, and disease-specific PTMs is needed for their 
optimal application as biomarkers to characterize pathogenesis and monitor 
for a disease. Because only a few proteins can be considered tissue specific, 
the combination of a tissue- or disease-specific protein with related specific 
proteases or factors offers the possibility to increase the sensitivity and speci- 
ficity of an assay analyzing specific combinations of PTMs. Well-designed 
combinations of PTMs that are disease specific may provide a new generation 
of biomarkers. In Table 31.1, different potential combinations of PTMs in a 
single assay are proposed to be targeted in different pathologies. 
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TABLE 31.1 Composition of Synovial Fluid Relative to Other Human 
Bodily Fluids [115-117] 


Synovial 
Test fluid Plasma Sweat Saliva Semen 
pH 7.4 7.35-7.45 4-6.8 6.48 Z 19 
Specific 1.008- 1.0278 1.001- 1.007 1.028 
gravity 1.015 1.008 


Electrolytes/mEq L7! 


Potassium 4.0 375550 4.3-14.2 pil Biles) 
Sodium 136.1 135-147 0-104 143 A 
Calcium 2.3-4.7 8-11 0.2-6.0 3 12.4 
Magnesium 1525 0.03—4 0.6 TS 
Bicarbonate 19.3-30.6 95-105 6? 24 

Chloride 107.1 22-28 34.3 U7 42.8 


Proteins/mg dL- 


Total 11.72 e6ll-" 6-8 g dL"! Beil 386P 4.5gdL"| 
Albumin 55-70% 50-65% 

a-1- 6-8% 3-5% 

Globulin 

a2- 5-7% 7-13% 

Globulin 

B-Globulin 8-10% 8-14% 

y-Globulin 10-14% 12-22% 


Hyaluronic 0.3- 
acid 0.4gdL7! 


Metabolites/mg dL- 


Amino 47.6 40 1.26gdL"! 
acids 

Glucose 70-110 70-110 3.0 11 224¢ 

Uric acid 2-8 2-8 26-1224 20 72 

Lipids, total 25-500° 188 


alncreases under salivary stimulation. 
Primary a-amylase, with some lysosomes. 
Fructose caption. 

4Not present in eccrine secretions. 
Cholesterol. 
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PTMs are the result of modifications to the composition or structure of 
proteins that expose new epitopes (ie, biomarker targets) called neoepitopes. 
Important processes creating PTMs are proteolytic cleavage (fragmentation); 
hydroxylation of prolines, resulting in hydroxyprolines; cleavage of procollagen 
peptides; crosslinking both intramolecular and intermolecular collagen fibrils 
[42-46]; age-related changes, such as isomerization and glycosylation [47,48], 
resulting in isoaspartic acids and advanced glycation end products such as pen- 
tosidine [47-50]; and nitrosylation of tyrosines. The PTM processes affect the 
protein at different levels. Some processes are important for maintenance of the 
tissue, whereas others are be destructive. This important difference has been 
described in cartilage turnover [16,51], where aggrecanase-mediated degrada- 
tion aggrecan in explant cultures, measured by release of 7/*ARGV fragments, 
of cartilage was reversible. However, when the disease stage had reached MMP- 
mediated tissue destruction, measured by release of *47FFGV, cartilage repair 
was no longer possible. This exemplified that different fragments of the same 
protein may reflect different pathobiological or physiological processes | 16,52]. 


Endopeptidase Activity 


Specific proteolytic activities are a prerequisite for a range of cellular functions 
and interactions within the ECM. These specific activities are tightly coordinated 
under physiological situations in which a detailed sequence of events produces 
the adequate proteolytic response to promote tissue turnover. Examples are the 
complex proteolytic activities needed for cell migration and function [53]. This 
ECM remodeling generates a range of tissue and pathospecific turnover prod- 
ucts that may be used as molecular biochemical markers. The proteolytic action 
of endopeptidases generates specific cleavage fragments. Even though many 
components of the ECM, as well as enzymes responsible for remodeling, are 
present in different tissues, the combination of a specific peptidase and spe- 
cific ECM protein may provide a unique combination that elucidates activity 
in a particular tissue or a specific disease mechanism. The action of peptidases 
on ECM components results in matrix degradation fragments (cleavage neo- 
epitopes). Previous examples of endopeptidase activity biomarkers have been 
addressed in previous sections of this chapter. 


Crosslinking 


Crosslinking plays an important part in the ECM meshwork and thereby in tissue 
integrity. Crosslinking is especially important for the fibrillar collagens such as 
types I and III and for minor collagens as types IV—XIV. The fibrillar collagens 
are characterized by their undisturbed triple helix that is stacked head-to-tail in 
connective tissue such as bone and cartilage [19]. This constitutes the backbone 
of the tissue. It is therefore important for collagen to be able to crosslink with 
the neighboring collagen as well as other ECM components [54]. Crosslinking 
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between different ECM components or between different protein chains can 
result from enzymatic and nonenzymatic pathways. Enzymatic crosslinking 
is often processed by lysyl oxidase. When the fibrils of collagen are aligned, 
aldehydes on lysines and hydroxylysine can react with corresponding resi- 
dues forming aldol condensation products [55]. Nonenzymatic glycosylation 
of fibrillar collagen as well as minor collagens are also important crosslinking 
residues, which are increased in several pathological conditions such as diabetes 
and fibrotic diseases [45,47,56,57]. 


Nonenzymatic Glycosylation 


Nonenzymatic glycosylation, also called the Maillard reaction, is a naturally 
occurring phenomenon and leads to PTM of proteins, nucleic acids, and lip- 
ids [58]. A common cause of nonenzymatic glycosylation is increased blood 
glucose levels; accordingly, most knowledge about nonenzymatic glycosylation 
arises from studies performed in diabetics [58]. Reducing glycosylated hemo- 
globin (HbA,,) is the primary objective when treating diabetics [58]. Aging is 
also associated with accumulation of glycosylations, and recent studies high- 
lighted that these glycosylations are associated with a range of diseases that 
typically afflict older populations, including OP, OA, Alzheimer, cardiovascular 
disease, chronic heart failure, liver fibrosis, and various nephropathies [59,60]. 

Nonenzymatic glycosylation is initiated by reducing sugars such as glucose, 
fructose, and pentose through reactions with the NH) side chain containing the 
amino acids arginine, lysine, and hydroxylysine. This leads to the formation 
of a Schiff base that then is converted into an Amadori product, also called an 
early glycosylation product [61]. HbA, is a commonly used example of an 
early glycosylation product, but because of the numerous publications about the 
relevance of HbA,,, it will not be described in further detail [58]. 

The Amadori product is then degraded, resulting in an available NH, group 
on the protein and a class of highly reactive dicarbonyls such as glyoxal, methyl 
glyoxal, or 3-deoxyglucosone that are referred to as intermediate glycosylation 
products [58,61]. Methyl glyoxal has been indicated to be a very important 
intermediate glycosylation product, primarily because it is a major precursor 
for advanced glycation end products (AGEs) (see below), but also because of its 
intrinsic effects on pancreatic beta cells as well as other cells [62—64]. The most 
important role of the intermediate glycosylation products, which are reactive 
carbonyls, is their reaction with the NH, side chain—containing amino acids, 
leading to irreversible AGEs [58,61,65]. If oxidation occurs at the time of the 
reaction, a category of AGEs sometimes referred to as glycoxidation products, 
are formed. These include pentosidine and Nt-carboxymethyllysine (N®-CML), 
which represent two classes of AGEs, namely, the crosslinking and noncross- 
linking types, respectively. Pentosidine and N*-CML are the two best-described 
AGEs [59]. AGEs consist of long-lived proteins and often cause damage by 
forming irreversible crosslinks between individual proteins, ie, between the 
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fibers in the triple helix of type I collagen [66]. In addition to crosslinking, the 
AGEs also activate specific receptors that presently are categorized into either 
inflammation or clearance receptors. Several AGEs have been described in the 
literature [61]. 

A more recent area of research for pentosidine is bone biology. Because pen- 
tosidine is a crosslinking AGE, it introduces molecular bonds between two long- 
lived proteins [67] such as collagen. In bone, collagen type I is, by far, the most 
abundant molecule. It has a very long half-life, depending on bone remodeling 
rates [66], thereby rendering it a prime candidate for nonenzymatic glycosyl- 
ation. Accordingly, pentosidine crosslinks are numerous in aged bone samples 
[68]. Furthermore, accumulation of pentosidine in collagen type I is associated 
with a deterioration in the functionality of the molecules [69]. The crosslinks 
formed in type I collagen are associated with increased stiffness of the collagen 
helices; and in bone, increased stiffness is associated with increased probability 
of fracture [66,68,70]. 

Compared with age-matched healthy subjects, OP patients have been found 
to have elevated serum levels of pentosidine, indicating that pentosidine is a 
serum marker of OP [71]. A study performed in elderly Japanese women indi- 
cated that urinary pentosidine levels were predictive of fracture risk [69]. In 
contrast, the OFELY study indicated that urinary pentosidine levels do not indi- 
cate increased fracture probability in women [72]; therefore, the utility of pento- 
sidine as a biochemical marker for OP is not clear and requires further research. 

Type 1 and type 2 diabetics have poor bone quality and increased fracture 
rates [73]. Interestingly, pentosidine has been speculated to be a causal link 
between diabetes and bone fractures [73,74]. Low serum levels of AGE recep- 
tor are also associated with increased probability of fracture in type 2 diabetics 
[75]. Because studies have shown that antibodies toward AGEs can be devel- 
oped in mice [76-78], we believe these markers, combined with the correct 
enzyme-generated neoepitope, will likely provide highly accurate assays for 
monitoring different diseases in which glucose levels and resulting modifica- 
tions are involved. 


Oxidations and Hydroxylations 


Oxidative damage to proteins is often caused by the action of free radicals, 
reactive oxygen species (ROS) such as nitric oxide, or hydrogen peroxide. Oxi- 
dation has been implicated in several pathological and healthy tissue turnover 
processes. Although many amino acids can be attacked by ROS, some seem 
more likely to undergo oxidation than others. For example, lysine and proline 
are readily oxidized to aldehydes and sulfoxidation of methionine and nitrosyl- 
ation of tyrosines also occurs [57]. Under normal conditions, these ROSs are 
strictly regulated by antioxidants such as peroxidases and dismutases, among 
others [79]. However, under pathological conditions, oxidation may play an 
important role in tissue destruction. 
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Structural proteins such as collagens are primary targets for superoxide 
dismutase-catalyzed nitration by peroxynitrite. When collagen chains are disas- 
sembled due to structural changes in the tissue, eg, due to pathological change, 
the tyrosines are exposed to their surroundings and thereby become susceptible 
to nitration [80]. The reaction of nitrogen to superoxide-producing peroxinitrite 
leads to the nitration of tyrosine residues in proteins, resulting in the formation 
of the posttranslationally modified nitrotyrosine (Fig. 31.2A). Nitrotyrosines 
are negatively charged, and this can lead to further disruption of the collagen 
[80]. Beckman et al. have shown that extensive nitration takes place around 
foamy macrophages in human atherosclerotic lesions and early fatty streaks, and 
also in the foci of myocytes in the vascular smooth muscle [81]. Thus, tyrosine 
nitration may be an opportune marker of reactive nitrooxidants being produced 
by pathological processes. Although peroxynitrite is not the only nitroreactive 
element, it is the most likely source in vivo [80]. Other reactive species such as 
nitrogen dioxide or nitrate can form nitrotyrosine in simple solutions. However, 
the amounts of nitrogen dioxide or nitrite that are present in vivo are far lower 
than necessary to cause significant nitration in vitro. 

Biochemical marker assays against nitrotyrosine in type II collagen have 
been developed. In some cases of arthritis, levels of nitrosylated type II colla- 
gen are elevated, and because type III collagen is turned over at a relatively high 
rate, the level of nitrosylated type II collagen can be measured in the circula- 
tion [82]. Richardot et al. showed that this neoepitope was elevated in serum 
of patients with arthritis and that the level significantly correlated with serum 
C-reactive protein (CRP) [82]. Similarly, elevated levels of nitrosylated type 
II collagen have been detected in RA and OA patients and were also in these 
studies correlated with CRP [83,84]. Circulating nitrosylate fragments of type 
II collagen have been shown to be significantly associated with disease severity 
in OA [83,84]. 


Isomerization: Age of ECM proteins 


PTMs accumulate with age [36]. The rate at which transformations of proteins 
occur and the fragments eventually build up is controlled by the level of tissue 
turnover. In tissues with well-balanced turnover, PTMs normally contribute to 
crucial cell signaling, protection of peptides against proteolysis, and strength 
of ECM—all components of matrix integrity [36]. Nevertheless, it has become 
well acknowledged that aging of proteins has an important impact on cell inter- 
actions at the molecular level [85] and that PTMs can be involved in various 
disease pathologies [86]. As an example, autoimmune responses to a specific 
PTM (as described later) are known to be associated with the hallmarks of pro- 
tein aging [85,87,88]. Peptides that contain amino acid isomerizations are often 
resistant to proteolysis [35], and this feature affects the procession of antigens 
for presentation on the major histocompatibility complex II involved in the 
immune-response signaling for the production of T cells and antibodies [36]. In 
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preclinical studies, it has been shown that various known autoantigens contain 
sites prone to deamidation and that isomerization is involved in type I diabetes, 
RA, systemic lupus erythematosus, and experimental autoimmune encephalo- 
myelitis [35,89]. Thus, isomerization plays an important role in some common 
diseases. 

Proteins containing aspartate (D), asparagine (N), glutamate (E), or gluta- 
mine (Q) residue linked to a low-molecular-weight amino acid, such as gly- 
cine (G), can undergo a spontaneous nonenzymatic isomerization (Fig. 31.2C) 
[36]. This isomerization introduces a kink in the conformation of the molecule, 
because the peptide backbone is redirected from the a-carboxyl group in the 
native newly synthesized form to the side chain f-carboxyl [48,90]. 


Citrullination and Deiminations 


Citrullination or deimination of proteins is the reaction for converting the 
amino acid arginine into the amino acid citrulline. This reaction is catalyzed by 
enzymes called peptidylarginine deiminases [91,92]. The conversion of arginine 
into citrulline can have important consequences for the structure and function of 
proteins, because arginine is positively charged at a neutral pH, whereas citrul- 
line is uncharged. This increases the hydrophobicity of the protein, leading to 
changes in protein folding. Proteins that normally contain citrulline residues 
include myosin basic protein, fillagrin, and several histone proteins, whereas 
other proteins such as fibrinogen, fibrin, and vimentin can be citrullinated dur- 
ing cell death and tissue inflammation [93]. 

Citrullinated protein antigens are the dominant targets of the autoimmune 
response in RA [94]. The presence of antibodies to citrullinated proteins is 
detected by anticyclic citrullinated peptide (CCP) assays. These constitute a 
powerful diagnostic tool for RA, surpassing in both sensitivity and specific- 
ity the rheumatoid factor [95]. A key characteristic of anti-CCP antibodies that 
adds to their overall value is their increased prognostic capacity. Evidence high- 
lights the increased sensitivity and specificity of anti-CCP2 assays [91,92,96]. 
The test for the presence of anticitrullinated protein antibodies is highly specific 
(88-96%) and is about as sensitive as the rheumatoid factor (70-78%) in diag- 
nosing RA. They are detectable even before the onset of clinical symptoms. 

Numerous ECM remodeling-trelated diseases are considered as autoimmune 
disorders, eg, cardiovascular disease [97]. In RA, inflammation is one of the 
main clinical manifestations. The great success that citrullinated proteins have 
had in RA diagnosis and prognosis could potentially be replicated in other dis- 
eases with similar clinical manifestations. In fibrotic, cancerous, or atheroscle- 
rotic tissue, citrullinated proteins that act as a connective link triggering alone 
or in conjunction with other PTMs an immune response, could be potential 
new targets for biomarker development. The potential benefits could include the 
identification of new ECM remodeling biomarkers with increased tissue speci- 
ficity when used in conjunction with ECM degradation fragments. 
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DIFFERENT MATRICES MEAN DIFFERENT THINGS 


The measurement of the same biomarker in different matrices can generate 
very different results; thus, it is important to understand which material is being 
assessed. 


Serum Versus Plasma 


Different blood-based matrices such as serum and plasma may be chosen for 
analytical samples. Despite serum being the most common matrix used, it may 
not always be suitable, particularly if the analyst is of hemostasis origin. In this 
case, plasma is the preferred matrix. Serum specimens are obtained by allow- 
ing the whole blood to clot, during which time various amounts of proteins are 
removed into the fibrin clot; so, the remaining, nonclotted sample may not con- 
tain the analyst. Unlike serum, however, plasma contains fibrinogen together 
with other procoagulant proteins that will result in coagulation unless subjected 
to an anticoagulant. The protein concentration of serum is therefore less than 
that of plasma [98,99]; thus, considerations should always be given to the ana- 
lyst and the choice of matrices. Plasma constitutes 55% of the total blood vol- 
ume and serves as a solvent for electrolytes, nutrients, and proteins important 
for blood clotting and immunity. Plasma comprises 92% water. It is obtained 
by separating the liquid portion of the blood from the cells by the use of an 
anticoagulant. 

Numerous anticoagulants and variations of these may be employed for 
obtaining plasma samples and the most well-known are heparinate, citrate 
and ethylenediamine tetraacetic acid (EDTA). These anticoagulants inhibit 
blood coagulation by impeding different phases of the coagulation cascade. 
Considerations to the choice of anticoagulant should be given as any one 
anti-coagulant may influence the analyte outcome. This is seen with the 
variation in plasma concentrations of matrix metalloproteinases and their 
tissue inhibitors upon the use of different anticoagulants [100,101]. The 
choice of anticoagulant may be even more important if the analyst is of 
coagulation activation origin. The concentration of these analysts may vary 
depending on the used anticoagulant [102], because anticoagulants differ in 
their ability to reduce clotting. Thus, a cross-matrix comparison is always 
desirable to allow for precision and accuracy of the analyst and to avoid 
selection of an inappropriate anticoagulant. 

A general drawback with plasma is that the chemical composition upon 
long-term storage may change and fibrin clot formation may occur. In addition, 
systematic errors such as a clotted sample due to improper sampling may lead 
to inaccurate analyst determination. Many hemostasis methods display high 
sensitivity to small preanalytical variation, as has been seen with repeated sam- 
pling from the same patients [103]. In fact, preanalytical irregularities such as 
incoherent coagulation may be associated with inconsistent laboratory results, 
leading to erroneous clinical interpretation. It is estimated that 32-68% of all 
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laboratory errors occur during the preanalytical phase [104]. This is one reason 
to why serum is in general preferred. However, the choice of matrices depends 
on the analyst in question. 


Urine 


There is evidence that the levels of neoepitope markers measured in urine and 
in serum or plasma can be radically different in animal models of kidney dis- 
ease | 105] and in patients with immunoglobulin A nephropathy [106]. This may 
reflect actual pathological processes. Urine is the natural matrix of the kidney; 
therefore, information obtained from proteins excreted or secreted from the 
affected kidneys into urine could describe the disease status, whereas proteins 
released into the circulation might reflect more systemic events such as inflam- 
mation. Moreover, different organs can contribute to the pool of peptides and 
proteins that are found in circulation [107]. 

Serum and plasma have a wide dynamic range of protein contents compared 
with urine. Albumin accounts for approximately 50% of the total mass of pro- 
teins in the serum, and the most abundant 22 proteins are estimated to account 
for 99% of the serum protein mass [107]. Urine, in contrast, has a much lower 
content of proteins, at least in healthy individuals with an intact filtration barrier. 

These disparities in protein content might also be explained by the fact that 
because urine is the product of blood filtration, peptides might go through fur- 
ther processing during filtration and during their permanence in the urine, and 
this can be reflected in the measurements of samples. This is particularly true 
in the case of neoepitope markers, which are detected in urine only if the neo- 
epitope is not further degraded. At the same time new neoepitopes might arise 
after the processing of peptides and proteins in the kidney, and the marker can 
therefore be measured only after these events. 

It is also necessary to take into account that metabolites filtered in the urine 
can influence the assays used to measure the biomarkers. An example of this is 
the alteration in levels of urine creatinine measured by the creatininase-based 
reaction assay (but not by the Jaffé-based reaction assay) in urine of patients 
with alkaptonuria. Urine from these patients contains high levels of homogen- 
tisic acid | 108]. This had been previously observed in routine clinical chemistry 
assays in urine [109]. For these reasons, it is important to, if possible, measure 
the biomarkers in both serum and urine and not to automatically translate the 
results obtained in one matrix to the other. 


Synovial Fluid 


Synovial fluid (SF) is the viscous liquid in the synovial cavity and is secreted by 
the synovial membrane. Its function is to reduce friction between the articular 
cartilages of the synovial joint during movement. It is a dialysate from plasma to 
which components produced locally by joint tissue is added. The composition 
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of the SF changes during an individual’s lifetime due to changes in mechanical 
loading [110,111] and with joint degradative diseases. The main components 
of SF are water, hyaluronic acid (HA; roughly 3-4 mg/mL), p-glucuronic acid, 
p-N-acetylglucosamine, growth hormone prolactin, and lubricin [112]. Because 
SF is the fluid covering the tissues of the joint, it is closer to the cartilage and 
synovial membrane than blood. In addition, cartilage is an avascularized tissue; 
thus, the fragments of cartilage need to pass through the SF to reach blood. This 
means that measuring fragments of cartilage and synovial membrane in SF is 
a more direct measure of the tissue turnover than an assessment in blood. In 
theory, SF is preferred over blood for assessment of joint turnover, especially 
cartilage and the synovial membrane. However, the very complex and viscous 
SF matrix can easily hide peptides and crystals, which complicates their assess- 
ment. There is no gold standard pretreatment of the SF before analysis; thus, 
several methods for analyzing protein and crystal content in SF are used. Even 
though retrieval of SF is fairly easy, it is not a common procedure in standard 
care of joint diseases. 

Hyaluronidase pretreatment is the most common pretreatment. It cleaves 
the HA in SF into smaller fragments, giving a less viscous liquid that is easier 
to work with in experiments. HA is a long polymer, which very substantially 
causes the viscous properties of SF. As a consequence of loading, associations 
of polymer chains of HA occur and rheopexic properties of SF are manifested 
[113]. The production of HA reaches its peak during adolescence and thereafter 
declines with age. Ultrasonication is another pretreatment method of SF [114]. 
Another method is to purify samples before using scanning electron micro- 
scope or atomic force microscope for assessment of crystals [115]. Centrifuga- 
tion after hyaluronidase pretreatment is another method that has been used to 
assess crystals [116], and pretreatments with papain and sodium hypochlorite 
have been used to assess crystals [117,118]. Last, a method where samples were 
subjected to deglycosylation with chondroitinase ABC, and keratinase has been 
used for assessment of aggrecan peptides [119]. All of these methods have been 
individually optimized for the specific peptide or crystal to be assessed, which 
illustrates the complexity of analyzing SF. 

In summary, SF is a complex, viscous fluid that offers potential for the 
assessment of cartilage and synovial membrane turnover. However, a standard 
method for pretreatment of the fluid is needed. 


Bronchoalveolar Lavage and Sputum 


Bronchoalveolar lavage (BAL) is a method that enables the investigation of cel- 
lular and acellular components of the lower respiratory tract by administration 
of saline. BAL retrieves the secretions that coat the surface of bronchial and 
alveolar epithelium, diluted by the instilled saline. The technique allows direct 
insight into processes involved in injury of the lung tissue including inflam- 
mation, tissue degradation, and changes in the permeability of the blood—air 
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barrier. BAL is easily performed, relatively reproducible, well tolerated, and 
can even be done in acutely ill patients [120]. Compared with a lung biopsy, 
it is less invasive, it may be done repeatedly, and it enables the sampling of 
a greater lung area. In healthy individuals, a BAL sample mainly consists of 
alveolar macrophages. Changes in the cell profile may support the diagno- 
sis of a specific interstitial lung disease [120-123]. Proteins in the BAL fluid 
may be produced locally in the lung or originate from plasma, presumably 
derived from diffusion across the blood—air barrier. Permeability may increase 
with disease due to inflammatory damage, resulting in elevated plasma protein 
levels in BAL as a consequence of increased protein leakage from the blood- 
stream to the lung tissue [124,125]. Both cellular profile and alterations in 
protein expression have been demonstrated to relate to various lung diseases 
including idiopathic pulmonary fibrosis, sarcoidosis, asthma, and cystic fibro- 
sis [126-129]. Although BAL is a fairly simple matrix with a basis of saline, 
the amount collected in each sample may vary greatly due to the choice of lung 
site, quantity of instilled fluid, and type of aspiration. Thus, results should be 
expressed with a reference value such as milliliters of BAL recovered or per 
microgram of total protein to account for differences in recovery [125]. The 
considerable variability in the techniques used for the procedure as well as 
sample handling and processing may lead to great variations from lab to lab 
[123]. In 2012, the American Thoracic Society published a guideline on the 
clinical utility of BAL in interstitial lung diseases. it included a standardiza- 
tion, handling, processing, and interpretation of BAL, which hopefully will 
result in more uniformed analyses in future [120]. 

An alternative to BAL for investigating the airways directly is the collec- 
tion of induced-sputum samples. A standardized procedure for the effective col- 
lection of a sputum sample using inhalation of nebulized saline has reduced 
the variability of the method [130,131]. Sputum collection is relatively nonin- 
vasive, safe, and reproducible, and it can be collected repeatedly even during 
severe disease or exacerbations. Evaluation of airway inflammation in sputum 
has been widely used in obstructive lung diseases such as asthma and COPD 
[132,133]. Sputum processing is laborious and includes the addition of dithioth- 
reitol (DTT) to dissociate the disulfide bonds of the mucus and thus homogenize 
the sample. As with BAL, variations in the sputum processing technique may 
cause variable results [134]. Studies have indicated that sputum measurements 
may be more reproducible than BAL measurements [132,135—137]. A study 
comparing BAL, induced sputum, and blood measurements of inflammatory 
cells and proteins in mild asthmatics resulted in differences presumably related 
to measurements of different compartments of the airways, which did not nec- 
essarily correlate [138]. The authors propose that BAL samples should be used 
for investigations of the peripheral airways and sputum samples for the larger 
airways. Conversely, blood measurements reflect systemic inflammation that 
does not seem to fully reflect the disease-associated inflammatory processes 
within the airways [137]. Sputum sampling have an advantage over BAL in 
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that it shows higher cell recovery and a stronger signal of acellular components 
[138]. However, sputum samples may be contaminated by saliva that may influ- 
ence measurements of both cells and proteins [139,140]. Sample analysis using 
immunoassays may also be affected by the content of DTT as this may disrupt 
the disulfide bonds of the antibody used. 


APPLYING STRUCTURAL BIOMARKERS AS DIAGNOSTIC 
OR MONITORING TOOLS 


Bone and skeletal disease were probably the first to use biomarkers measuring 
different biomarkers of type I collagen. Bone biomarkers have been used to test 
cathepsin K inhibitors and other antiresorptives for the treatment of OP, such as 
ONO-5334 [140a], denosumab [141] and odanacatib [142]. Cathepsin K is an 
osteoclast collagenase that mediates bone resorption measurable by CTX-I or 
N-terminal telopeptide of type I collagen (NTX-I) in either serum or urine of 
patients. For example, ONO-5334 was tested in 197 postmenopausal women 
with OP or osteopenia with one fragility fracture. After 24 months, ONO-5334 
was shown to be associated with increased bone mineral density for lumbar 
spine, total hip, and femoral neck (P<.001), whereas the levels of both serum 
and urine CTX-I, and urine NTX-I as early as 2 weeks after initiation of treat- 
ment. In patients treated with ONO-5334, the bone-resorption markers urinary 
(uJ)NTX and serum and uCTX-I were significantly suppressed throughout 
24 months of treatment and to a similar extent as was seen in patients receiv- 
ing alendronate [143,144]. Another type I collagen biomarker, initial carboxyl- 
terminal telopeptide, levels were increased in patients receiving ONO-5334. 
Cathepsin K inhibition with ONO-5334 resulted in decreases in most resorption 
markers over 2 years, but did not decrease most bone formation markers. This 
was associated with an increase in radiographic bone mass density. The effect 
on biomarkers was rapidly reversible on treatment cessation. This is a recent 
example of how biomarkers of the same protein provide different information 
and can be used as early indicators of treatment efficacy. 

Another example of a type I collagen serological biomarker is C1M [22], 
which was tested in the LITHE study. This was a 1-year phase III, double-blind, 
placebo-controlled parallel group study of tocilizumab (TCZ) 4 or 8 mg/kg every 
4 weeks, in RA patients on stable doses of methotrexate (MTX) [145]. At baseline, 
CIM was significantly (P<.0001) correlated to CRP, to a visual analogue scale 
pain (P<.0001), disease activity score (DAS28-ESR) (P<.0001), joint space nar- 
rowing (JSN) (P<.01), and modified total Sharp score (mTSS) (P<.001). These 
correlations indicate that C1M is marker of disease activity [146]. In addition, 
baseline C1M was significantly correlated with change in JSN or mTSS from 
baseline to week 24 (P<.0001) and to week 52 (P<.0001). CIM levels were dose 
dependently reduced in the TCZ+MTX group. Thus, baseline C1M levels cor- 
related with worsening joint structure over 1 year. Serum C1M levels may enable 
identification of those RA patients who are in most need of aggressive treatment. 
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Type I collagen is almost solely produced by the cartilage cells, chondro- 
cytes. Great efforts have been made to develop specific biomarkers targeting 
different epitopes of type II collagen and thereby measures of type II collagen 
turnover [147,148]. Urinary CTX-II is one of most tested biomarkers and has 
been used in several preclinical and clinical studies. Serum PITANP and uri- 
nary CTX-II were measured in the CIMESTRA study that recruited early RA 
patients to investigate the relationship between markers of collagen I formation 
and degradation with disease activity measures and radiographic outcomes at a 
4-year follow-up [149]. PITANP was low at diagnosis and 4 years on (P<.001), 
irrespective of treatment and disease activity. CTX-II at baseline was increased 
(P<.001) and correlated positively with disease activity and radiographic pro- 
gression, but not with anti-CCP. These results suggest that type II collagen for- 
mation and degradation are unbalanced when RA is diagnosed. CTX-II has also 
been used as a pharmacokinetic and pharmacodynamics marker in different set- 
tings. The bioavailability and pharmacodynamics of synthetic salmon calcito- 
nin and recombinant salmon calcitonin were tested in healthy postmenopausal 
women. Measurement of bone resorption by using the CTX-I and CTX-II bio- 
markers displayed comparable responses, with areas under the curve of rela- 
tive change of serum CTX-I of —250% xhours and relative change in urine 
CTX-II of —180% xhours during the 4-h observation period. The biomarker 
data led investigators to conclude that oral synthetic and recombinant calcito- 
nin displayed comparable pharmacodynamic and kinetic properties [150]. Sev- 
eral other studies have used CTX-II to monitor and compare treatment effect in 
patients with joint diseases [151,152]. 

Urinary type II collagen neoepitope (TIINE) is another biomarker of type 
II collagen degradation. It has been used to investigate the mechanism underly- 
ing the slowing of JSN in patients with OA. Otterness et al. tested the effect 
of doxycycline in knee OA [153-155]. The mean TIINE concentration for 
doxycycline-treated patients was higher than that for the placebo group. Hellio 
Le Graverand et al. examined whether uTIINE could distinguish subjects with 
progressive radiographic and/or symptomatic knee OA from those with stable 
disease [156]. Baseline uTIINE levels were unrelated to JSN in the placebo 
group. However, among subjects in the active treatment arm, a 1 SD incre- 
ment in baseline uTIINE (68 ng/mM Cr) was associated with a marginally sig- 
nificant, two-fold increase in the odds of progression of JSN (odds ratio 2.04). 
The within-subject mean of uTIINE values at baseline, 6 months, and 12 months 
was associated with concurrent JSN measured at 16 weeks. Although baseline 
uTIINE was not a consistent predictor of JSN in subjects with knee OA, serial 
measurements of uTIINE reflect concurrent JSN. 

Other biomarkers of type II collagen turnover have been used. Anti-TNF 
therapies provide symptomatic benefit for patients with spondyloarthropathy 
(SpA). The effect of etanercept on biomarkers of type H collagen synthesis and 
degradation in patients with SpA followed for 2 years was investigated by Briot 
et al. [157,158]. Cartilage degradation was investigated by measuring serum 
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levels of the type II collagen fragments and C2C, whereas the C-terminal pro- 
peptide of type II collagen (PIICP) was used as a marker of type II collagen 
synthesis. Over 2 years, there was a significant decrease of serum C2C (P<.01). 
Compared with baseline, the decrease in serum C2C was significant at month 12 
(-12.1%; P<.01). Conversely, PIICP increased significantly by 17% (P<.01) 
from baseline to 24 months. These data suggest that etanercept may have ben- 
eficial effects on cartilage metabolism in patients with SpA. These data have 
been supported by several follow-up studies measuring the biomarkers C2M 
and ProC2 in SpA patients [31]. 

Hu et al. [159] compared apparent diffusion coefficient (ADC) values on 
diffusion-weighted imaging from hepatic fibrosis patients with those from 
healthy controls. The purpose was to identify correlation between ADC val- 
ues with serum indices of liver fibrosis. HA, laminin (LN), type II procolla- 
gen (PCIII), and collagen type IV (IV-C) were measured in 54 hepatic fibrosis 
patients and 23 healthy controls. With progressive liver fibrosis, PCIII levels 
were increased (P<.01). There were negative correlations between ADC and 
LN, PCHI, HA, and IV-C. 

During hepatic fibrosis progression, the quantity and quality of the hepatic 
ECM changes with an up to five-fold increase in total collagen content along 
with a change in the collagen profile resulting in twice the amount of type I 
collagen compared with type III collagen [160,161]. The increase in collagen 
deposition is subsequently followed by a shift in matrix composition from the 
low-density basement membrane-like matrix to an interstitial matrix containing 
fibril-forming collagens [162]. 

Regardless of etiology, cirrhosis is the end stage of progressive fibrogen- 
esis. However, the developmental pattern of fibrosis depends on the underly- 
ing etiology [163]. Hepatic stellate cells (HSCs) have long been considered the 
main fibrogenic cell type, possibly due to their ability to become isolated from 
human and rodent liver tissue. Most of the knowledge related to hepatic fibrosis 
has been based on in vitro activation of HSCs. Several fibroblast-like cell types 
contributing to fibrosis development have been identified, including septal and 
interface myofibroblasts and smooth muscle cells [164-167]. 

Nielsen et al. assessed two markers (Pro-C3 and C3M) of formation and 
degradation of type III collagen in baseline serum and stratified according to 
the baseline Ishak scores. They found the two markers had similar diagnostic 
performances as aspartate transaminase, alanine transaminase, and FibroTest. 
Interestingly, when stratifying the baseline marker levels to the changes in Ishak 
scores after 52 weeks, Pro-C3 was the only marker that significantly differenti- 
ated progressors from stable patients. Patients with high levels of Pro-C3 at 
baseline had more than four times higher odds of being progressors of fibrosis 
than those with low baseline levels [168]. 

Another example of the use of biomarkers is in the diagnosis of hernias, 
where a disturbed metabolism in the ECM contributes to an abnormal formation 
of the abdominal wall. A recent study investigated the level of ECM biomarkers 
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in patients with different types of hernia: Primary unilateral inguinal hernia, 
multiple hernias defined as three or more hernias, and incisional hernia. Bio- 
markers for synthesis of interstitial matrix (PINP, Pro-C3, Pro-C5) and base- 
ment membrane (P4NP) as well as corresponding degradation (CIM, C3M, 
C5M, and C4M) were measured in serum. In inguinal hernia patients, the turn- 
over of the interstitial matrix collagens types II (P<.05) and V (P<.001) was 
decreased compared with controls, whereas the turnover of the basement mem- 
brane IV-C was increased (P<.001). In incisional hernia patients, the turnover 
of type V collagen was decreased (P<.05) and the turnover of type IV collagen 
was increased compared with the hernia-free controls (P<.001). Hernia patients 
demonstrated systemically altered collagen metabolism. The serological turn- 
over profile of IV-Cs may predict the presence of inguinal and incisional hernia. 
The results indicated that regulation of IV-C turnover may be crucial for hernia 
development and may provide novel insight into the pathogenesis of hernia. 

These examples highlight that ECM biomarkers using the generation of 
PTMs may provide novel insights into the pathology of disease as well as act as 
diagnostic or prognostic biomarkers. 


BIPED CLASSIFICATION IN OA 


The BIPED [4,5] categorization emphasizes the different biomarker roles: bur- 
den of disease, investigative, prognostic, efficacy of intervention, and diagnos- 
tic. Here, attention is drawn to the most important parameters for clinical trials, 
by using OA as an example First, to select the study population, diagnostic and 
prognostic markers are needed. A diagnostic (or alternatively a burden of dis- 
ease) marker must ensure that the population is at the desired or well-defined 
stage of OA at baseline. A prognostic marker will likely report that the popu- 
lation has a high risk of progressing during the study period if they remain 
untreated. Finally, before final phase III clinical studies, an efficacy of interven- 
tion marker is needed to evaluate efficacy (Fig. 31.3). 

The development of diagnostic, burden of disease, or prognostic biomarkers 
requires well-defined clinical trials, often cross-sectional case-control studies. 
It is therefore important that several trivial data are collected from the subjects 
included, such as sex, age, disease severity, and treatment history, to define 
a cohort within the general population and thereby narrow the field of inter- 
est (Fig. 31.3). The purpose of the diagnostic marker in this respect would be 
to distinguish between different states (eg, diseased vs nondiseased, treated vs 
nontreated, mild vs severe disease) and isolate patients from the general pop- 
ulation. In contrast, burden of disease markers should be able to distinguish 
between severities or extent of disease within the group that were “positive” 
for the diagnostic marker. A prognostic marker can ideally predict the outcome 
from a given baseline situation of a patient by calculating the relative risk and 
odds ratio. It is consequently important that the biomarker or biomarkers chosen 
for the study are limited to those considered to be a gold standard. In OA, this 
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FIGURE 31.3 Three major classifications of the BIPED classification [4]. 


gold standard is often radiographic (eg, Kellgren—Lawrence scoring), which is 
not optimal because its accuracy is very much dependent on the observer. Both 
diagnostic and prognostic biomarkers should have high sensitivity and specific- 
ity for differentiation of diseased and nondiseased subjects. This is where many 
of the existing biomarkers tend to fail: many have shown that two cohorts have 
significantly different mean levels of a given biomarker, but failed to identify a 
relatively high percentage of individual patients as belonging to either cohort. 
In reality, a panel of markers is needed for predicting disease progression for an 
individual patient. 

When a risk group has been defined by, eg, diagnostic and prognostic mark- 
ers, a clinical trial can be taken a step further to test the effect of a potential 
treatment in randomized controlled trials. One such example was to investigate 
whether the progression of OA could be halted by treatment with bisphospho- 
nates. The correlation between the outcome and the marker data at baseline will 
define whether the marker is an efficacy of intervention marker (Fig. 31.3). 

In clinical trials, measuring biochemical markers in body fluids has typically 
several advantages over more conventional assessment tools such as radiogra- 
phy: (1) the collection of body fluid, such as blood or urine, samples is nonin- 
vasive, and can therefore be repeated over long-running studies; (2) test results 
can be achieved quickly; (3) the development of the disease can be followed 
continuously; and (4) on the basis of statistical power calculations, a relatively 
small sample size is needed to complete biomarker analysis. 

In addition, validated and qualified diagnostic and prognostic biomark- 
ers are useful tools in new drug development both in preclinical and clinical 
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phases. Ideally, the markers should enable more or less effortless identifica- 
tion of subjects with specific disease characteristics and predict the effect of 
the candidate drug, its suspected side effects, or both. Furthermore, biomark- 
ers identified for one disease might be applied to another disease, if it is 
suspected that the other disease also affects the release or representation of 
the same biomarker. 


CONCLUSION AND PERSPECTIVES 


In this book, we have highlighted the possibilities that are emerging in clini- 
cal chemistry, by the combination of multiple disease-specific neoepitopes in 
third-generation biochemical marker assays. This approach has already been 
applied to some disease areas such as bone and fibrosis, and it may be advanta- 
geous in yet other disease areas. By incorporating the most optimal biochemical 
markers in all aspects of drug discovery and development, with the promise of 
translational science, novel treatment opportunities may be identified, and drug 
research efforts may be stimulated by the ease of early detection of both efficacy 
and safety concerns. 

Biochemical markers based on the advanced disease-tissue neoepitope 
approach may be one important tool to be used in combination with others. This 
approach may bring added recognition of the value of in vitro diagnosis and 
enable the integration of better markers into more aspects of research and drug 
development. 
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